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Preface 


The timing of à school on Heavy Flavour Physics in August 2001 was particularly 
fortunate. Both BaBar and Belle had reported in the previous month new results with 
clear evidence for CP-violation in B4 decays. The Standard Model had been subjected 
to a qualitatively and quantitatively new test, and passed with flying colours. The result 
on sin 28 ~ 0.7 + 0.2 was in excellent agreement with the Standard Model expectations 
based on non-CP violating processes of sin 28 of about 0.7. The CP-violation observed 
more than 35 years ago in the neutral kaon system was confirmed as coming, at least 
predominantly, from the complex phase in the quark mixing matrix. 


Of course, there is more, much more, to Heavy Flavour Physics than CP violation 
and the b-quark systems, and the course of lectures reflected this—'heavy flavour' was 
interpreted generously to include the c-quark and as appropriate the s-quark. 


The subject was approached from a strongly phenomenological point of view. Jonathan 
Rosner gave an overview of the Standard Model and its development, Gerhard Buchalla 
introduced the topic of Heavy Quark Theory and Christine Davies explained how these 
ideas could be implemented on the lattice. Yosef Nir gave a thorough introduction to 
CP-violation, while Steve Abel introduced concepts beyond the current Standard Model. 
Sheldon Stone gave an overview of B-phenomenology, and the present and future exper- 
imental programme was reviewed by Klaus Schubert (B-factories), Peter Krizan (HERA 
and the Tevatron), and Tatsuya Nakada (LHCb and BTeV). Harry Nelson discussed the 
opportunities for observing physics beyond the Standard Model in the charm sector, and 
Konrad Kleinknecht discussed the status of CP-violation in the neutral kaon system. Two 
topics strictly outside the domain of heavy flavour, but with significant relevance were 
covered. Lawrence Krauss gave an introduction to cosmology, and Ken Peach discussed 
the possibility that CP-violation might one day be measurable in the neutrino sector at 
a neutrino factory. 


We believe that this series of lectures will provide a thorough introduction to the 
phenomenology of heavy flavour physics, not only those working on the B-factories, LHCb 
or BTeV, but also those working on HERA, the Tevatron and eventually the LHC general 
purpose detectors. While the wealth of data from the B-factories and the Tevatron will 
lead to new discoveries and, we hope, clues to the physics beyond the Standard Model, it 
is already clear that the model provides a very good description of the data. Meanwhile, 
these lectures will be a useful guide to these processes, and will enable the student, 
whether theorist or experimentalist, to judge the significance of these developments as 
they unfold. 

Summer schools are not just about science: they are also about dialogue, discussion, 
meeting people, and making friends. The school succeeded in this secondary aim, aided 
by full social programme and a friendly environment provided by the staff of the John 
Burnet Hall. Within this, the scientific discussions and personal interactions flourished. 
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The Standard Model in 2001 


Jonathan L Rosner 


Enrico Fermi Institute and Department of Physics 


University of Chicago, USA 


DOI: 10.1201/9780429187056-1 


1 Introduction 


The "Standard Model" of elementary particle physics encompasses the progress that has 
been made in the past half-century in understanding the weak, electromagnetic, and 
strong interactions. The name was apparently bestowed by my doctoral thesis advisor. 
Sam Treiman, whose dedication to particle physics kindled the light for so many of his 
students during those times of experimental and theoretical discoveries. These lectures 
are dedicated to his memory. 


As graduate students at Princeton in the 1960s, my colleagues and I had no idea of the 
tremendous strides that would be made in bringing quantum field theory to bear upon 
such a wide variety of phenomena. At the time, its only domain of useful application 
seemed to be in the quantum electrodynamics (QED) of photons, electrons, and muons. 


Our arsenal of techniques for understanding the strong interactions included analyt- 
icity, unitarity, and crossing symmetry (principles still of great use), and the emerging 
SU(3) and SU(6) symmetries. The quark model (Gell-Mann 1964, Zweig 1964) was just 
beginning to emerge, and its successes at times seemed mysterious. The ensuing decade 
gave us a theory of the strong interactions, quantum chromo-dynamics (QCD), based 
on the exchange of self-interacting vector quanta. QCD has permitted quantitative cal- 
culations of a wide range of hitherto intractable properties of the hadrons (Lev Okun's 
name for the strongly interacting particles), and has been validated by the discovery of 
its force-carrier, the gluon. 


In the 1960s the weak interactions were represented by a phenomenological (and un- 
renormalisable) four-fermion theory which was of no use for higher-order calculations. 
Attempts to describe weak interactions in terms of heavy boson exchange eventually 
bore fruit when they were unified with electromagnetism and a suitable mechanism for 
generation of heavy boson mass was found. This electroweak theory has been spectacu- 
larly successful, leading to the prediction and observation of the W and Z bosons and 
to precision tests which have confirmed the applicability of the theory to higher-order 
calculations. 
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In this introductory section we shall assemble the ingredients of the standard model 
—the quarks and leptons and their interactions. We shall discuss both the theory of the 
strong interactions, quantum chromo-dynamics (QCD), and the unified theory of weak 
and electromagnetic interactions based on the gauge group SU(2) & U(1). Since QCD is 
an unbroken gauge theory, we shall discuss it first, in the general context of gauge theories 
in Section 2. We then discuss the theory of charge-changing weak interactions (Section 3) 
and its unification with electromagnetism (Section 4). The unsolved part of the puzzle, 
the Higgs boson, is treated in Section 5, while Section 6 concludes. 


These lectures are based in part on courses that I have taught at the University of 
Minnesota and the University of Chicago, as well as at summer schools (e.g., Rosner 1988, 
1997). They owe a significant debt to the fine book by Quigg (1983). 


Quarks Leptons 
Q-2/3 Q--1/3 Q--1 Q-0 


logio [M/(1 GeV/c?)] 


{ 
peupepepmepp 


Figure 1. Patterns of charge-changing weak transitions among quarks and leptons. The 
strongest inter-quark transitions correspond to the solid lines, with dashed, dot-dashed, 
and dotted lines corresponding to successively weaker transitions. 


1.1 Quarks and leptons 


The fundamental building blocks of strongly interacting particles, the quarks, and the 
fundamental fermions lacking strong interactions, the leptons, are illustrated, along with 
their interactions, in Figure 1. Masses, as quoted by the Particle Data Group (2000), 
are summarised in Table 1. The relative strengths of the charge-current weak transitions 
between the quarks are summarised in Table 2. 


The quark masses quoted in Table 1 are those which emerge when quarks are probed at 
distances short compared with 1 fm, the characteristic size of strongly interacting particles 
and the scale at which QCD becomes too strong to utilise perturbation theory. When 
regarded as constituents of strongly interacting particles, however, the u and d quarks act 
as quasi-particles with masses of about 0.3GeV. The corresponding “constituent-quark” 
masses of s, c, and b are about 0.5, 1.5, and 4.9GeV, respectively. 
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Quarks Leptons 
Charge 2/3 Charge —1/3 | Charge —1 Charge 0 
Mass Mass Mass Mass 
u 0.001-0.005 | d 0.003-0.009 | e 0.000511 | ve <3eV 
c 1.15-1.35 s 0.075-0.175 | p. 0.106 v, «190 keV 
t 174345.1 |b 4.0-4.4 T 1.777 v, «18.2 MeV 


Table 1. The known quarks and leptons. Masses in GeV ezcept where indicated other- 


wise. Here and elsewhere we take c = 1. 


Relative Transition | Source of information 
amplitude (example) 

~l uod Nuclear -decay 

~ 1 cos Charmed particle decays 
~ 0.22 uc s Strange particle decays 
~ 0.22 cod Neutrino prod. of charm 
~ 0.04 cob b decays 

~ 0.003-0.004 utb Charmless b decays 

~l tob Dominance of t 2 Wb 
~ 0.04 tos Only indirect evidence 
~ 0.01 tod Only indirect evidence 


Table 2. Relative strengths of charge-changing weak transitions. 


1.2 Colour and quantum chromo-dynamics 


The quarks are distinguished from the leptons by possessing a three-fold charge known 
as “colour” which enables them to interact strongly with one another. (A gauged colour 
symmetry was first proposed by Nambu 1966.) We shall also speak of quark and lepton 
“flavour” when distinguishing the particles in Table 1 from one another. The experimental 
evidence for colour comes from several quarters. 


1. Quark statistics. One of the lowest-lying hadrons is a particle known as the Att, 
an excited state of the nucleon first produced in «tp collisions in the mid-1950s at the 
University of Chicago cyclotron. It can be represented in the quark model as uuu, so 
it is totally symmetric in flavour. It has spin J = 3/2, which is a totally symmetric 
combination of the three quark spins (each taken to be 1/2). Moreover, as a ground state, 
it is expected to contain no relative orbital angular momenta among the quarks. 


This leads to a paradox if there are no additional degrees of freedom. A state composed 
of fermions should be totally antisymmetric under the interchange of any two fermions, 
but what we have described so far is totally symmetric under flavour, spin, and space 


4 Jonathan L Rosner 


interchanges, hence totally symmetric under their product. Colour introduces an addi- 
tional degree of freedom under which the ingerchenge of two quarks can produce a minus 
sign, through the representation A** ~ e,4,u*u^u*. The totally antisymmetric product 
of three colour triplets is a colour singlet. 


2. Electron-positron annihilation to hadrons. The charges of all quarks which can be 
produced in pairs below a given centre-of-mass energy is measured by the ratio 


R 


g(e*e" — M 5 

ip ere EN A AT d 1 

c(ete- — pty = gi: (1) 

For energies at which only uü, dd, and s8 can be produced, i.e., below the charmed-pair 
threshold of about 3.7GeV, one expects 


2\? -1\? -1\? 2 
R=N,|(= — —]|22N, 
GO) «G9 «G9 3 (2) 
for N, “colours” of quarks. Measurements first performed at the Frascati laboratory in 
Italy and most recently at the Beijing Electron-Positron Collider (Bai et al. 2001: see 


Figure 2) indicate R = 2 in this energy range (with a small positive correction associated 
with the strong interactions of the quarks), indicating N, = 3. 


R Value 


BESII (1998) 
BESII (1999) 
Gamma2 
Markl 


5 
E,,, (GeV) 


Figure 2. Values of R measured by the BES Collaboration. 


3. Neutral pion decay. The a? decay rate is governed by a quark loop diagram in 
which two photons are radiated by the quarks in 7° = (uü — dd)/4/2. The predicted rate 
is 


S? 2 
ae i E (3) 
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where f, = 131MeV and S = N,(Q? — Q2) = N./3. The experimental rate is 7.8+0.6 eV, 
while (3) gives 7.65? eV, in accord with experiment if 5 = 1 and N, = 3. 


4. Triality. Quark composites appear only in multiplesof three. Baryons are composed 
of qqq, while mesons are qd (with total quark number zero). This is compatible with our 
current understanding of QCD, in which only colour-singlet states can appear in the 
spectrum. Thus, mesons M and baryons B are represented by 


1 1 c 
M = (g),  B--z(aqq d"). (4) 


v3 v6 


Direct. evidence for the quanta of QCD, the gluons, was first presented in 1979 on the 
basis of extra “jets” of particles produced in electron-positron annihilation to hadrons. 
Normally one sees two clusters of energy associated with the fragmentation of each quark 
in e*e^ — qd into hadrons. However, in some fraction of events an extra jet was seen, 
corresponding to the radiation of a gluon by one of the quarks. 


The transformations which take one colour of quark into another are those of the 
group SU(3). We shall often refer to this group as SU(3)colour to distinguish it from the 
SU(3) favour associated with the quarks u, d, and s. 


1.3 Electroweak unification 


The electromagnetic interaction is described in terms of photon exchange, for which the 
Born approximation leads to a matrix element behaving as 1/q?. Here q is the four- 
momentum transfer, and q? is its invariant square. The quantum electrodynamics of 
photons and charged point-like particles (such as electrons) initially encountered calcula- 
tional problems in the form of divergent quantities, but these had been tamed by the late 
1940s through the procedure known as renormalisation, leading to successful estimates of 
such quantities as the anomalous magnetic moment of the electron and the Lamb shift in 
hydrogen. 


By contrast, the weak interactions as formulated up to the mid-1960s involved the 
point-like interactions of two currents, with interaction Hamiltonian Hw = G rJ, SH J 2, 
where Gr = 1.16637(1) x 10-°GeV~? the current value for the Fermi coupling constant. 
This interaction is very singular and cannot be renormalised. The weak currents J, in 
this theory were purely charge-changing. As a result of work by Lee and Yang, Feynman 
and Gell-Mann, and Marshak and Sudarshan in 1956-7 they were identified as having 
(vector)-(axial) or “V—A” form. 

Hideki Yukawa (1935) and Oskar Klein (1938) proposed a boson-exchange model for 
the charge-changing weak interactions. Klein's model attempted a unification with elec- 
tromagnetism and was based on a local isotopic gauge symmetry, thus anticipating the 
theory of Yang and Mills (1954). Julian Schwinger and others studied such models in 
the 1950s, but Glashow (1961) was the first to realise that a new neutral heavy boson 
had to be introduced as well in order to successfully unify the weak and electromagnetic 
interactions. The breaking of the electroweak symmetry (Weinberg 1967, Salam 1968) 
via the Higgs (1964) mechanism converted this phenomenological theory into one which 
could be used for higher-order calculations, as was shown by 't Hooft and Veltman in the 
early 1970s. 
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The boson-exchange model for charge-changing interactions replaces the Fermi in- 
teraction constant with a coupling constant g at each vertex and the low-g? limit of a 
propagator, 1/(Miy — q?) > 1/Miy, with factors of 2 chosen so that Gp/ v2 = g? /8M$y. 
The q? term in the propagator helps the theory to be more convergent, but it is not the 
only ingredient needed, as we shall see. 


The normalisation of the charge-changing weak currents J, suggested well in advance 
of electroweak unification that one regard the corresponding integrals of their time com- 
ponents (the so-called weak charges) as members of an SU(2) algebra (Gell-Mann and 
Lévy 1960, Cabibbo 1963). However, the identification of the neutral member of this 
multiplet as the electric charge was problematic. In the V—A theory the W's couple 
only to left-handed fermions v, = (1 — y5)W/2, while the photon couples to v; + wp, 
where ig = (1 + y5)9/2. Furthermore, the high-energy behaviour of the vy —2 WtW- 
scattering amplitude based on charged lepton exchange leads to unacceptable divergences 
if we incorporate it into the one-loop contribution to vi — vv (Quigg 1983). 


A simple solution was to add a neutral boson Z coupling to WWT and vr in such 
a way as to cancel the leading high-energy behaviour of the charged-lepton-exchange 
diagram. This relation between couplings occurs naturally in a theory based on the gauge 
group SU(2) & U(1). The Z leads to neutral current interactions, in which (for example) 
an incident neutrino scatters inelastically on a hadronic target without changing its charge. 
The discovery of neutral-current interactions of neutrinos and many other manifestations 
of the Z proved to be striking confirmations of the new theory. 


Identifying the Wt and W7 with raising and lowering operations in an SU(2), so that 
W+ = (W'xiW?)V2, then left-handed fermions may be assigned to doublets of this “weak 
isospin”, with Igz(u.c, t) Di(we, v, v) = 41/2 and IPaj(ds.b)-Ig (e. 77) = —1/2. 
All the right-handed fermions have J, = I3, = 0. As mentioned, one cannot simply 
identify the photon with W?, which also couples only to left-handed fermions. Instead, 
one must introduce another boson B associated with a U(1) gauge group. It will mix 
with the W? to form physical states consisting of the massless photon A and the massive 
neutral boson Z: 


A= Bcosé+W'siné, Z = -Bsin6 + W? cos0. (5) 


The mixing angle @ appears in many electroweak processes. It has been measured to 
sufficiently great precision that one must specify the renormalisation scheme in which it 
is quoted. For present purposes we shall merely note that sin? 8 ~ 0.23. The correspond- 
ing SU(2) and U(1) coupling constants g and g' are related to the electric charge e by 


e = gsinÓ = g' cos 0, so that 
1 1 1 


The electroweak theory successfully predicted the masses of the W* and Z: 
Mw ~ 38.6 GeV /sin 0 ~ 80.5 GeV, Mz ~ Mw/cos@ ~ 91.2 GeV, (7) 


where we show the approximate experimental values. The detailed check of these predic- 
tions has reached the precision that one can begin to look into the deeper structure of 
the theory. A key ingredient in this structure is the Higgs boson, the price that had to be 
paid for the breaking of the electroweak symmetry. 
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1.4 Higgs boson 


An unbroken SU(2) & U(1) theory involving the photon would require all fields to have 
zero mass, whereas the W+ and Z are massive. The symmetry-breaking which generates 
W and Z masses must not destroy the renormalisability of the theory. However, a massive 
vector boson propagator is of the form 


-9w + q,9,/M* 
Dw(q) = “tae 


(8) 
where M is the boson mass. The terms q,g,, when appearing in loop diagrams, will 
destroy the renormalisability of the theory. They are associated with longitudinal vector 
boson polarisations, which are only present for massive bosons. For massless bosons like 
the photon, there are only transverse polarisation states J, = +J. 


The Higgs mechanism, to be discussed in detail later in these lectures, provides the 
degrees of freedom needed to add a longitudinal polarisation state for each of Wt, W7, 
and W?. In the simplest model, this is achieved by introducing a doublet of complex 


Higgs fields: 
+ P 
ZIMIE [2]. (9) 


Here the charged Higgs fields 6+ provide the longitudinal component of WË and the linear 
combination (¢°— 99)/iv/2 provides the longitudinal component of the Z. The additional 
degree of freedom (¢° + ¢°)/V2 corresponds to a physical particle, the Higgs particle. 
which is the subject of intense searches. 


Discovering the nature of the Higgs boson is a key to further progress in understanding 
what may lie beyond the Standard Model. There may exist one Higgs boson or more than 
one. There may exist other particles in the spectrum related to it. The Higgs boson may 
be elementary or composite. If composite, it points to a new level of substructure of the 
elementary particles. Much of our discussion will lead up to strategies for the next few 
years designed to address these questions. First, we introduce the necessary topic of gauge 
theories, which have been the platform for all the developments of the past thirty years. 


2 Gauge theories 


2.1 Abelian gauge theories 


The Lagrangian describing a free fermion of mass m is Liree = v d—m)v. It is invariant 
under the global phase change  — exp(ia)w. (We shall always consider the fermion fields 
to depend on z.) Now consider independent phase changes at each point: 


Vy > y = exp[ia(z)]v. (10) 


Because of the derivative, the Lagrangian then acquires an additional phase change at 
each point: ó£g,, = vi," [i0,a(x)|v. The free Lagrangian is not invariant under such 
changes of phase, known as local gauge transformations. 
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Local gauge invariance can be restored by the replacement 0, > D, = 0, + ieA, in 
the free-fermion Lagrangian, which now becomes 


Vi p-m)) = $i 8 mv — ev A(z). (11) 


The effect of a local phase in can be compensated if we allow the vector potential A, 
to change by a total divergence, which does not change the electromagnetic field strength 
(defined as in Peskin and Schroeder 1995; Quigg 1983 uses the opposite sign) 


Fy, = 0,A, — 0, Ay. (12) 


Indeed, under the transformation Yy — y’ and with A — A’ with A’ yet to be determined, 
we have 


= y(i8- mw -e Kv = Yli P—m)b— ylfa —epA'y. (13) 
This will be the same as £ if 
Ai, (x) = A, (x) - Taala) . (14) 


The derivative D, is known as the covariant derivative. One can check that under a local 
gauge transformation, D, — e'*(? D), y. 

Another way to see the consequences of local gauge invariance suggested by Yang 
(1974) and discussed by Peskin and Schroeder (1995, pp 482-486) is to define —eA,(x) 
as the local change in phase undergone by a particle of charge e as it passes along an 
infinitesimal space-time increment between z^ and z^ + dz". For a space-time trip from 
point A to point B, the phase change is then 


Ag = exp (-« I, ° Agle)de") (15) 


The phase in general will depend on the path in space-time taken from point A to point B. 
As a consequence, the phase $45 is not uniquely defined. However, one can compare the 
result of a space-time ap along one path, leading to a phase e. with that along another, 
leading to a phase peL. The two-slit experiment in quantum mechanics involves such a 
comparison; so does the Bohm-Aharonov effect in which a particle beam traveling past 
a solenoid on one side interferes with a beam traveling on the other side. Thus, phase 
differences 


of ar — = Oo = exp (- ie f Ayla («)de") ; (16) 


associated with closed paths in space-time (represented by the circle around the integral 
sign), are the ones which correspond to physical experiments. The phase $c for a closed 
path C is independent of the phase convention for a charged particle at any space-time 
point rg, since any change in the contribution to ®¢ from the integral up to rg will be 
compensated by an equal and opposite contribution from the integral departing from xg. 


The closed path integral (16) can be expressed as a surface integral using Stokes' 
theorem: 


f Ade) da^ = f Ful rdo” , (17) 
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where the electromagnetic field strength F,, was defined previously and do*" is an element 
of surface area. It is also clear that the closed path integral is invariant under changes (14) 
of A,(r) by a total divergence. Thus F,, suffices to describe all physical experiments as 
long as one integrates over a suitable domain. In the Bohm-Aharonov effect, in which a 
charged particle passes on either side of a solenoid, the surface integral will include the 
solenoid (in which the magnetic field is non-zero). 


If one wishes to describe the energy and momentum of free electromagnetic fields, one 
must include a kinetic term Lg = —(1/4)F,, F"" in the Lagrangian. which now reads 


1 _ — 
L= -Fu F” + Bi B- my — eb Ay. (18) 


If the electromagnetic current is defined as J = OY, then this Lagrangian leads to 
Maxwell’s equations. 


The local phase changes (10) form a U(1) group of transformations. Since such trans- 
formations commute with one another, the group is said to be Abelian. Electrodynamics, 
just constructed here, is an example of an Abelian gauge theory. 


2.2 Non-Abelian gauge theories 


One can imagine that a particle traveling in space-time undergoes not only phase changes, 
but also changes of identity. Such transformations were first considered by Yang and Mills 
(1954). For example, a quark can change in colour (red to blue) or flavour (u to d). In 
that case we replace the coefficient eA, of the infinitesimal displacement dz, by an n x n 
matrix -gA, (x) = —gA, (z)T; acting in the n-dimensional space of the particle's degrees 
of freedom. (The sign change follows the convention of Peskin and Schroeder 1995.) 
For colours, n — 3. The T; form a linearly independent basis set of matrices for such 
transformations, while the Ai are their coefficients. The phase transformation then must 
take account of the fact that the matrices A,(r) in general do not commute with one 
another for different space-time points, so that a path-ordering is needed: 


@4,=P E (i f. 52] (19) 


When the basis matrices T; do not commute with one another, the theory is non-Abelian. 


We demand that changes in phase or identity conserve probability, i.e., that $45 be 
unitary: l oap — 1. When 4g is a matrix, the corresponding matrices A, (x) in (19) 
must be Hermitian. If we wish to separate out pure phase changes, in which A,(z) is a 
multiple of the unit matrix, from the remaining transformations, one may consider only 
transformations such that det(9 45) = 1, corresponding to traceless A, (x). 


The n x n basis matrices T; must then be Hermitian and traceless. There will be 

n? — ] of them, corresponding to the number of independent SU(N) generators. (One 

can generalise this approach to other invariance groups.) The matrices will satisfy the 
commutation relations 

IT, Tj] = ie; Tk, (20) 

where the cj, are structure constants characterising the group. For SU(2), cij = €:j% (the 

Kronecker symbol), while for SU(3), cije = fijk, where the fij, are defined in Gell-Mann 
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and Ne'eman (1964). A 3 x 3 representation in SU(3) is T; = A;/2, where A;/2 are the 
Gell-Mann matrices normalised such that Tr A;A; = 26;;. For this representation, then, 
Tr T;T; = 1/2. 

In order to define the field-strength tensor F,, = FLT; for a non-Abelian transfor- 
mation, we may consider an infinitesimal closed-path transformation analogous to (16) 


for the case in which the matrices A,(x) do not commute with one another. The result 
(see, e.g., Peskin and Schroeder 1995, pp 486-491) is 


F,, = 0,A, — 0,A, — ig[A,, Av], Fi, = 0,4, — OA}, + gei AL AL (21) 

An alternative way to introduce non-Abelian gauge fields is to demand that, by analogy 
with (10), a theory involving fermions 7 be invariant under local transformations 

v(r) > v(z)—U(z)i(z), | UtU-1, (22) 


where for simplicity we consider unitary transformations. Under this replacement we find 
that £L — £', where 


£zg(if-mw = WU —m)Uv 
di 9 m) + ivU y" (QU). (23) 
As in the Abelian case, an extra term is generated by the loca] transformation. It can be 
compensated by replacing 0, by 

ð, > D, = 0, — igA, (x). (24) 
In this case £ = v(i P — m) and under the change (22) we find 


7 


Cap(-mw = oU" (ig K —mUxs 
= L+V[g(U' KU- K) - iU (BU). (25) 
This is equal to £ if we take 
A! =UA,U7! — (Ove . (26) 
This reduces to our previous expressions if g = —e and U =e“), 


The covariant derivative acting on y transforms in the same way as w itself under 
a gauge transformation: D, => Div’ = UD,v. The field strength F,, transforms 
as F,, > Fi, = UF,,U~'. It may be computed via [D,,D,] = —igF,,; both sides 


"E 
transform as U( U~! under a local gauge transformation. 


In order to obtain propagating gauge fields, as in electrodynamics, one must add 
a kinetic term £y = —(1/4)F7, F""" to the Lagrangian. Recalling the representation 
E, = Fi, in terms of gauge group generators normalised such that Tr(T;T;) = 6;;/2, we 
can write the full Yang-Mills Lagrangian for gauge fields interacting with matter fields as 


L=- VIF, F^) + VG D - my. (27) 


We shall use Lagrangians of this type to derive the strong, weak, and electromagnetic 
interactions of the “Standard Model." 
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The interaction of a gauge field with fermions then corresponds to a term in the 
interaction Lagrangian AL = gy(z) A,(z)v(z). The [A,,A,] term in F,, leads to 
self-interactions of non-Abelian gauge fields, arising solely from the kinetic term. Thus, 
one has three- and four-field vertices arising from 


Ace) = (0,A;)gcij AY Ave ALY = E itin AP AMAT AY . (28) 


These self-interactions are an important aspect of non-Abelian gauge theories and are 
responsible in particular for the remarkable asymptotic freedom of QCD which leads to its 
becoming weaker at short distances, permitting the application of perturbation theory. 


2.3 Elementary divergent quantities 


In most quantum field theories, including quantum electrodynamics, divergences occurring 
in higher orders of perturbation theory must be removed using charge, mass, and wave 
function renormalisation. This is conventionally done at intermediate calculational stages 
by introducing a cutoff momentum scale A or analytically continuing the number of space- 
time dimensions away from four. Thus, a vacuum polarisation graph in QED associated 
with external photon momentum k and a fermion loop will involve an integral 


d'p 1 1 . 
T) ~ f T (= "CE "D : (29) 


a self-energy of a fermion with external momentum p will involve 


dq 1 1 
Si ~ | Gaia ge goa (30) 
and a fermion-photon vertex function with external fermion momenta p, »' will involve 
d'k 1 1 1 
A.U! p) e f , " 
„(p'a p) (Oni Wt fom "pe bom’ (31) 


The integral (29) appears to be quadratically divergent. However, the gauge invariance of 
the theory translates into the requirement k“II,,,, = 0, which requires II, to have the form 


IL, (k) = (Qu — k,k,)H(K?). (32) 


The corresponding integral for II(&?) then will be only logarithmically divergent. The inte- 
gral in (30) is superficially linearly divergent but in fact its divergence is only logarithmic. 
as is the integral in (31). 

Unrenormalised functions describing vertices and self-energies involving ng external 
boson lines and nr external fermion lines may be defined in terms of a momentum cutoff 
A and a bare coupling constant go (Coleman 1971, Ellis 1977, Ross 1978): 

DU ue = TE n ri go. ^). (33) 


ng.nrg ng.np 


where p; denote external momenta. Renormalised functions T? may be defined in ternis of 
a scale parameter 44, a renormalised coupling constant g = g(go. A/p), and renormalisation 
constants Zg(A) and Zr(A) for the external boson and fermion wave functions: 


Tř (pign) = lim [ZA] [Ze AJ] Tr, n (Pigo A) (34) 
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The scale p is typically utilised by demanding that T? be equal to some predetermined 


function at a Euclidean momentum p? = —p?. Thus, for the one-boson, two-fermion 
vertex, we take 
R : = 
Ty2(0,P,—P)| oj» = lim ZrZgVro(0.p,—p)|, | , = 9: (35) 


The unrenormalised function T" is independent of p, while TË and the renormalisation 
constants Zg(A), Zr(A) will depend on u. For example, in QED, the photon wave 
function renormalisation constant (known as Z3) behaves as 

ag A? 
Z4-1-ln—. 
3 3, In rr (36) 
The bare charge ey and renormalised charge e are related by e = ey Zyl”. To lowest 
order in perturbation theory, e « eg. The vacuum behaves as a normal dielectric; charge 
is screened. It is the exception rather than the rule that in QED one can define the 
renormalised charge for q? = 0; in QCD we shall see that this is not possible. 


2.4 Scale changes and the beta function 


We differentiate both sides of (34) with respect to j and multiply by ui. Since the functions 
I" are independent of p, we find 


8 , Og On _ [nB ôZe ne ÔZF\ ong gue pu 
Gara (pig. p) = lim (zs ou +z” ot) zs ZI". (37) 
or 8 2 
la. t MUT T npgys(g) + ew TA(p;, 9, i) — 0, (38) 
where 2 " » 
Pl) = uar ya(g) = Zp ðu ` yr(9) = Ze ön ` (39) 


The behaviour of any generalised vertex function TË under a change of scale p is then 
governed by the universal functions (39). 


Here we shall be particularly concerned with the function B(g). Let us imagine p — Ap 
and introduce the variables t = In À, 9(g, t) = g(go, A/Az), Then the relation for the beta- 
function may be written 


499.0 = BG) ` g(g. 0) = 9(go. A/H) =g. (40) 


Let us compare the behaviour of g with increasing ¢ (larger momentum scales or shorter 
distance scales) depending on the sign of (3). In general we will find 8(0) = 0. We take 
B(g) to have zeroes at g = 0, gi, go, .... Then: 


1. Suppose B(g) > 0. Then g increases from its t = 0 value g = g until a zero gi of 
B(g) is encountered. Then g — g; as t — oc. 


2. Suppose B(g) < 0. Then 9 decreases from its t = 0 value g = g until a zero g; of 
B(G) is encountered. 
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In either case § approaches a point at which 8(g) = 0, 8'(g) < 0 as t — oco. Such 
points are called ultraviolet fired points. Similarly, points for which 3(g) = 0, 8’(g) > 0 
are infrared fixed points, and g will tend to them for t ^ —oo (small momenta or large 
distances). The point e = 0 is an infrared fixed point for quantum electrodynamics, since 
B'(e) >Date — 0. 

It may happen that 8'(0) < 0 for specific theories. In that case g = 0 is an ultraviolet 
fixed point, and the theory is said to be asymptotically free. We shall see that this property 
is particular to non-Abelian gauge theories (Gross and Wilczek 1973, Politzer 1974). 


2.5 Beta function calculation 


In quantum electrodynamics a loop diagram involving a fermion of unit charge contributes 
the following expression to the relation between the bare charge eg and the renormalised 


charge e: 
ag, A 
e=a(1- n1). (41) 
as implied by (35) and (36), where ag = e2/47. We find 
e e € 
“129? ~ 123?" 


(42) 


where differences between eg and e correspond to higher-order terms in e. Thus f(e) > 0 
for small e and the coupling constant becomes stronger at larger momentum scales (shorter 
distances). 


We shall show an extremely simple way to calculate (42) and the corresponding result 
for a charged scalar particle in a loop. From this we shall be able to first calculate the 
effect. of a charged vector particle in a loop (a calculation first performed by Khriplovich 
1969) and then generalise the result to Yang-Mills fields. The method follows that of 
Hughes (1980). 


When oné takes account of vacuum polarisation, the electromagnetic interaction in 
momentum space may be written 


e? e? 


$C g?[l + H(g?)] 


Here the long-distance (q?-50) behaviour has been defined such that e is the charge 
measured at macroscopic distances, so II(0) = 0. Following Sakurai (1967), we shall 
reconstruct IL;(g?) for a loop involving the fermion species i from its imaginary part, 
which is measurable through the cross section for e*e^ — ii: 


(43) 


Im IL(s) = ——e(e*e- ii), (44) 


where s is the square of the centre-of-mass energy and a = e?/4z. For fermions f of 
charge e; and mass my, 


2 2m 4m2\ ? 
Im II(s) = = ( + 223 ( - zu 6(s — Am?), (45) 
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while for scalar particles of charge e, and mass m;, 


2 4 2 3/2 
Im IL(s) = 75 (.- ud 8(s — 4m). (46) 


The corresponding cross section for ete” — pty, neglecting the muon mass, is 
a(ete” — u* p^) = 4no? /3s, so one can define 
R; = a(ete” > ii)fo(e'e a u*y ), (47) 


in terms of which Im IL(s) = oR;(s)/3. For s > oo one has Rj(s) — e$ for a fermion 
and R,(s) > e?/4 for a scalar. 


The full vacuum polarisation function II;(s) cannot directly be reconstructed in terms 
of its imaginary part via the dispersion relation 


I(s) = Zu d tm Hs), (48) 
4m? 8'-- S 

since the integral is logarithmically divergent. This divergence is exactly that encountered 
earlier in the discussion of renormalisation. For quantum electrodynamics we could deal 
with it by defining the charge at q? = 0 and hence taking IL(0) = 0. The once-subtracted 
dispersion relation for I; (s) — Il;(0) would then converge: 


IL(s) = zl. — d ImIL(). (49) 


m? S'(s' — s) 


However, in order to be able to consider cases such as Yang-Mills fields in which the 
theory is not well-behaved at q? = 0, let us instead define IL;(—4?) = 0 at some spacelike 


scale q? = —4?. The dispersion relation is then 
1 1 1 
IL(s) = f, ds E — ZA al Im IL(s). (50) 
For || >> p? >> m?, we find 
a -¢ 
IL(g) 2 -—Ri(oo) L — + const. ; (51) 
3r H 


and so, from (43), the “charge at scale g” may be written as 


e? . a -g 
The beta-function here is defined by &(e) = u(Oe/Op) at fixed e,. This expression gives 
B(e) = —fee?/ (162?) + O(e*) and one finds By = -(4/3)e? for spin-1/2 fermions and 
Bo = —(1/3)e? for scalars. 


These results will now be used to find the value of 5o for a single charged massless vector 
field. We generalise the results for spin 0 and 1/2 to higher spins by splitting contributions 
to vacuum polarisation into “convective” and “magnetic” ones. Furthermore, we take into 
account the fact that a closed fermion loop corresponds to an extra minus sign in IIy(s) 
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(which is already included in our result for spin 1/2). The “magnetic” contribution of 
a particle with spin projection S, must be proportional to S?. For a massless spin-S 
particle, S? = S?. We may then write 


Bo -f (—1)"* (aS? +0)(S =0), 


(-1y'*(aS? + 26)(S £0), (53) 


where np = 1 for a fermion, 0 for a boson. The factor of 2b for S # 0 comes from the 
contribution of each polarisation state (S, = +5) to the convective term. Matching the 
results for spins 0 and 1/2, 


1 4 a 
bla, -12 (242), | 
3 3 ($+ (54) 
we find a = 8 and hence for S = 1 
2 22 


The magnetic contribution is by far the dominant one (by a factor of 12), and is of 
opposite sign to the convective one. A similar separation of contributions, though with 
different interpretations, was obtained in the original calculation of Khriplovich (1969). 
The reversal of sign with respect to the scalar and spin-1/2 results is notable. 


2.6 Group-theoretic techniques 


The result (55) for a charged, massless vector field interacting with the photon is also the 
value of 89 for the Yang-Mills group SO(3) ~ SU(2) if we identify the photon with As 
and the charged vector particles with AX = ( A} F iA?)/ V2. We now generalise it to the 
contribution of gauge fields in an arbitrary group G. 

The value of 3o gauge fields depends on a sum over all possible self-interacting gauge 
fields that can contribute to the loop with external gauge field labels i and m: 


AG]... acre (56) 
BSTC] 7 SESE 


where ce, is the structure constant for G, introduced in (20). The sums in (56) are 
proportional to dim: 


CijkCmjk = dimC2(A) t (57) 


The quantity C3(A) is the quadratic Casimir operator for the adjoint representation of 
the group G. 


Since the structure constants for SO(3) ~ SU(2) are just Gu = eig, one finds 


C2(A) = 2 for SU(2), so the generalisation of (55) is that Bo gauge fields = (11/3)C2(A). 
The contributions of arbitrary scalars and spin-1/2 fermions in representations R are 
proportional to T(R), where 
Tr (LT;) = &T() (58) 


for matrices T; in the representation R. For a single charged scalar particle (e.g., a 
pion) or fermion (e.g. an electron), T(R) = 1. Thus £o spino = —(1/3)To0(R), while 
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Bo spin 1/2 = —(4/3)T y (R), where the subscript on T(R) denotes the spin. Summarising 
the contributions of gauge bosons, spin 1/2 fermions, and scalars, we find 


Bo = SCA) - 3 VST als) - Vo s (Re) ) (59) 
f Li 


One often needs the beta-function to higher orders, notably in QCD where the per- 
turbative expansion coefficient is not particularly small. It is 


5 


- gp J 
Ba) = -Po Ter? A neut , (60) 


where the result for gauge bosons and spin 1/2 fermions (Caswell 1974) is 
2 
By = 3 UTI (CP - 10T(R)C2(4) - 6T(R)C:(R)} . (61) 


The first term involves loops exclusively of gauge bosons. The second involves single- 
gauge-boson loops with a fermion loop on one of the gauge boson lines. The third involves 
fermion loops with a fermion self-energy due to a gauge boson. The quantity C2(R) is 
defined such that 

[T'(R)T*( Blas = Co (Ros , (62) 


where a and f are indices in the fermion representation. 


We now illustrate the calculation of C2(A), T(R), and Co(R) for SU(N). More general 
techniques are given by Slansky (1981). 


Any SU(N) group contains an SU(2) subgroup, which we may take to be generated 
by Ti, To, and T3. The isospin projection 7; may be identified with 73. Then the J3 value 
carried by each generator T; (written for convenience in the fundamental N-dimensional 
representation) may be identified as shown below: 


2 —-N-22 

0 1 1/2 --- 1/2 
-1 0 |-1/2 --- —1/2 
—1/2 1/2 0 ee 0 
—1/2 1/2 0 e 0 


Since C,(A) may be calculated for any convenient value of the index 7 = m in (57). 
we chose i = m = 3. Then 


1 2 
CY(A)— Y (I)? 2 14-14-4(N - 2) (5) =N. (63) 


adjoint 


As an example, the octet (adjoint) representation of SU (3) has two members with |73| = 1 
(e.g., the charged pions) and four with |Z3| = 1/2 (e.g.. the kaons). 
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For members of the fundamental representation of SU(N), there will be one member 
with J; = 41/2, another with J; = —1/2, and all the rest with J; = 0. Then again 
choosing i = m = 3 in (58), we find T(R)|tundamental = 1/2. The SU(N) result for 8) in the 
presence of n; spin 1/2 fermions and n, scalars in the fundamental representation then 


may be written 9 
11 1 

—2LN-Ín,-2 

foy N- 3n 5 

The quantity C2(A) in (63) is most easily calculated by averaging over all indices 

a = fl. If all generators T’ are normalised in the same way, one may calculate the result 
for an individual generator (say, T3) and then multiply by the number of generators [N? — 1 


for SU(N)]. For the fundamental representation one then finds 
1 1? 1] N?-1 
UN n? 7 (5) (-5) = ` 
Co(R) = wl D| 2) * V3 2N (65) 


2.7 The running coupling constant 


n,. (64) 


One may integrate (60) to obtain the coupling constant as a function of momentum scale 
M and a scale-setting parameter A. In terms of à = g’/47, one has 


dà a? a? M? 
= — t=2= — ]. 
dr 7 Am P gg f=In ( A? (66) 
For large t the result can be written as 
- 4m By Int’ _ 
M?) = —— |1- 5—-| +01’). 7 
am’) = 5" | a + OE) (67) 


Suppose a process involves p powers of & to leading order and a correction of order 
pt. 
apr’: 


T = Aa’[1 + Ba + O(a’). (68) 
If A is rescaled to AA, then t 3 t' —21nA = t'(1 — 21n A/t^), so 
o? — à ( + taima) . (69) 
T 


The coefficient B thus depends on the scale parameter used to define a. 


Many prescriptions have been adopted for defining A. In one ('t Hooft 1973), the 
*minimal subtraction" or MS scheme, ultraviolet logarithmic divergences are parame- 
terised by continuing the space-time dimension d = 4 to d = 4 — e and subtracting 
pole terms f d*7*/p* ~ 1/e. In another (Bardeen et al. 1978) (the “modified minimal 
subtraction" or MS scheme) a term 


1 1 m4r- 
= -4 (70) 


€ € 2 


containing additional finite pieces is subtracted. Here yg = 0.5772 is Euler's constant, and 
one can show that Ayg = Ams exp[(In 4a — ye)/2]. Many O(a) corrections are quoted in 
the MS scheme. Specification of A in any scheme is equivalent to specification of a(M?). 
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Figure 3. Scale-dependence of the strong-coupling constant às(M?) subject to the con- 
straint àg( M2) = 0.118 + 0.002. The solid line shows the central value; dashed lines 
indicate 10 limits. 


2.8 Applications to quantum chromodynamics 


A “golden application” of the running coupling constant to QCD is the effect of gluon 
radiation on the value of R in e*e^ annihilations. Since R is related to the imaginary part 
of the photon vacuum polarisation function II(s) which we have calculated for fermions 
and scalar particles, one calculates the effects of gluon radiation by calculating the correc- 
tion to II(s) due to internal gluon lines. The leading-order result for colour-triplet quarks 
is R(s) 2 R(s)[1 + àa(s)/»]. There are many values of s at which one can measure such 
effects. For example, at the mass of the Z, the partial decay rate of the Z to hadrons 
involves the same correction, and leads to the estimate &s(M2) = 0.118 + 0.002. The 
dependence of &.5(M?) satisfying this constraint on M? is shown in Figure 3. As we shall 
see in Section 5.1, the electromagnetic coupling constant also runs, but much more slowly, 
with a^! changing from 137.036 at q? = 0 to about 129 at q? = M2. 


A system which illustrates both perturbative and non-perturbative aspects of QCD is 
the bound state of a heavy quark and a heavy antiquark, known as quarkonium (in analogy 
with positronium, the bound state of a positron and an electron). We show in Figures 4 
and 5 the spectrum of the cé and bb bound states (Rosner 1997). The charmonium (cc) 
system was an early laboratory of QCD (Appelquist and Politzer 1975). 


The S-wave (L = 0) levels have total angular momentum J, parity P. and charge- 
conjugation eigenvalue C equal to JP? = 0*^ and 177 as one would expect for ! Sy and 4S; 
states, respectively, of a quark and antiquark. The P-wave (L = 1) levels have JPC = 1*- 
for the ! P,, 0** for the P5, 1** for the * Pj, and 2** for the? P. The JPC = 177 levels 
are identified as such by their copious production through single virtual photons in e*e^ 
annihilations. The 0 * level 7, is produced via single-photon emission from the J/y (so its 
C is positive) and has been directly measured to have J? compatible with 0^. Numerous 
studies have been made of the electromagnetic (electric dipole) transitions between the 
S-wave and P-wave levels and they, too, support the assignments shown. 
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Figure 4. Charmonium (cc) spectrum. Observed and predicted levels are denoted by 
solid and dashed horizontal lines, respectively. Arrows denote electromagnetic transitions 
(labeled by y) and hadronic transitions (labeled by emitted hadrons). 
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Figure 5. Spectrum of bb states. Observed and predicted levels are denoted by solid 
and dashed horizontal lines, respectively. In addition to the transitions labeled by arrows, 
numerous electric dipole transitions and decays of states below BB threshold to hadrons 
containing light quarks have been seen. 
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Figure 6. Spectrum of lowest-lying states containing one charmed and one light quark. 
Observed and predicted levels are denoted by solid and broken horizontal lines, respectively. 
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Figure 7. Spectrum of lowest-lying states containing one bottom and one light quark. 
Observed and predicted levels are denoted by solid and broken horizontal lines, respectively. 
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The 66 and cé levels have a very similar structure, aside from an overall shift. The 
similarity of the cé and bb spectra is in fact an accident of the fact that for the interquark 
distances in question (roughly 0.2 to 1 fm), the interquark potential interpolates between 
short-distance Coulomb-like and long-distance linear behavior. The Coulomb-like behav- 
ior is what one would expect from single-gluon exchange, while the linear behavior is a 
particular feature of non-perturbative QCD which follows from Gauss' law if chroino- 
electric flux lines are confined to a fixed area between two widely separated sources 
(Nambu 1974). It has been explicitly demonstrated by putting QCD on a space-time 
lattice, which permits it to be solved numerically in the non-perturbative regime. 


States consisting of a single charmed quark and light (u, d, or s) quarks or antiquarks 
are shown in Figure 6. Finally, the pattern of states containing a single b quark (Figure 7) 
is very similar to that for singly-charmed states, though not as well fleshed-out. In many 
cases the splittings between states containing a single b quark is less than that between 
the corresponding charmed states by roughly a factor of m/m, ~ 1/3 as a result of the 
smaller chromomagnetic moment of the b quark. Pioneering work in understanding the 
spectra of such states using QCD was done by De Rüjula ef al. (1975), building on earlier 
observations on light-quark systems by Zel'dovich and Sakharov (1966), Dalitz (1967), 
and Lipkin (1973). 


3 W bosons 


3.1 Fermi theory of weak interactions 


The effective four-fermion Hamiltonian for the V—A theory of the weak interactions is 


Gp i - | Gre _ - 
Hw = Vi [o m *)u»] [o^ (1 - 75)U4| = B (Fire) (Vaio vun) . (71) 
where G and wz, were defined in Section 1.3. We wish to write instead a Lagrangian 
for interaction of particles with charged W bosons which reproduces (71) when taken to 
second order at low momentum transfer. We shall anticipate a result of Section 4 by 
introducing the W through an SU(2) symmetry, in the form of a gauge coupling. 


In the kinetic term in the Lagrangian for fermions, 
Lrs = Yli p- m) = Yili Pdi + Vali Pin — mv, (72) 


the Ø term does not mix V; and vg, so in the absence of the Yy term one would have the 
freedom to introduce different covariant derivatives D acting on left-handed and right- 
handed fermions. We shall find that the same mechanism which allows us to give masses 
to the W and Z while keeping the photon massless will permit the generation of fermion 
masses even though v; and vg will transform differently under our gauge group. We 
follow the conventions of Peskin and Schroeder (1995, p 700 ff). 


We now let the left-handed spinors be doublets of an SU(2), such as 
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(We will postpone the question of neutrino mixing until the last section.) The W is 
introduced via the replacement 


0, > D, = 0, — ig TW}, Tis r/2, (74) 


where 7? are the Pauli matrices and Wi are a triplet of massive vector bosons. Here we 
will be concerned only with the W+, defined by Ww = (Wi T iW2)/ y. The field Wt 
annihilates a W * and creates a W7, while W; annihilates a W^ and creates a W+. Then 


W} = (Wt - W,)/V2 and W? = i(W,t — W7 )/ V2, so 


3 + 
TW = ; | Wi E vw, | . (75) 
V2W- -W} 
The interaction arising from (72) for a lepton | = e, p, T is then 
£x) = 3 lu Woh ele wn] (76) 
int, l V2 LY Wyte E FLY Wy, Min] , 


where we temporarily neglect the W? terms. Taking this interaction to second order 
and replacing the W propagator (Mi, — q?)-! by its q? = 0 value, we find an effective 
interaction of the form (71), with 
Gr_ g (77) 
JZ T 8MEC TT 
3.2 Charged-current quark interactions 


The left-handed quark doublets may be written 


, ; ; ; 7 
d' L s, b L 


where d', s', and b are related to the mass eigenstates d, s, b by a unitary transformation 


d d 
s|-2V|s|, VV=l1. (79) 
b b 


The rationale for the unitary matrix V of Kobayashi and Maskawa (1973) will be reviewed 
in the next section when we discuss the origin of fermion masses in the electroweak theory. 
The interaction Lagrangian for W's with quarks then is 


u d 


* 9 ,p5 ; 
Lim, quarks = SUL W; V Di) he, Uz|ic|, Ds|s|. (80) 
t 
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A convenient parameterisation of V (conventionally known as the Cabibbo-Kobayashi- 
Maskawa matrix, or CKM matrix) suggested by Wolfenstein (1983) is 


2 
Via Vas Va 1-4 d — AM(p— in) 
VE Va V. Vo | = -A 1-4 A» |e (80 
Va Vis Vis AM(1—p-—in) —AX 1 


Experimentally A ~ 0.22 and A œ 0.85. Present constraints on the parameters p and 7 
are shown in Figure 8. The solid circles denote limits on |Vis/|Ves| = 0.090 + 0.025 from 


0.6 


0.4 


0.2 


ur 
IN rw 
| L 1 gd Oy — L1. L 
—0.50 —0.25 0.00 0.25 0.50 
p 


Figure 8. Constraints on parameters of the Cabibbo- Kobayashi- Maskawa (CKM) matriz. 
The plotted point at p = 0.21, 7 = 0.38 lies in the middle of the allowed region. (See text.) 


charmless b decays. The dashed arcs are associated with limits on Vig from BB mixing. 
The present lower limit on B,-B, mixing leads to a lower bound on |Vi,/Via| and the dot- 
dashed arc. The dotted hyperbolae arise from limits on CP-violating Ko mixing. The 
phases in the CKM matrix associated with 7 Æ 0 lead to CP violation in neutral kaon 
decays (Christenson et al. 1964) and, as recently discovered, in neutral B meson decays 
(Aubert ef al. 2001a, Abe et al. 2001). These last results lead to a result shown by the 
two rays, sin(28) = 0.79 + 0.10, where 8 = Arg( VV; / Vi Vj; ). The small dashed lines 
represent lo limits derived by Gronau and Rosner (2002) (see also Luo and Rosner 2001) 
on the basis of CP asymmetry data of Aubert et al. (2001b) for B? — x* z^. Our range 
of parameters (confined by 1ø limits) is 0.10 € p < 0.32, 0.33 € 7 € 0.43. Similar plots 
are presented in several other lectures at this Summer School (see, e.g., Buchalla 2001, 
Nir 2001, Schubert 2001, Stone 2001), which may be consulted for further details, and an 
ongoing analysis of CKM parameters by Hocker et al. (2001) is now incorporating several 
other pieces of data. 
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3.3 Decays of the r lepton 


The 7 lepton (Perl et al. 1975) provides a good example of “standard model" charged- 
current physics. The r^ decays to a v, and a virtual W~ which can then materialise into 
any kinematically allowed final state: e~v,, p^ v,, or three colours of td’, where, in accord 


with (81), d œ 0.975d + 0.225. 


Neglecting strong interaction corrections and final fermion masses, the rate for r decay 
is expected to be 
TI(r- > all) = 5G? m 2x 107 eV (82) 
| SF 19273 — , 
corresponding to a lifetime of 7, ~ 3 x 107? s as observed. The factor of 5 — 1 -- 14-3 
corresponds to equal rates into e 4, 5 7,, and each of the three colours of üd'. The 
branching ratios are predicted to be 


Bir” > vre De) = Bir” > ph) = lg- > v üd’) = 2096. (83 
di 3 


Measured values for the purely leptonic branching ratios are slightly under 18%, as a 
result of the enhancement of the hadronic channels by a QCD correction whose leading- 
order behavior is 1 + ag/z, the same as for R in e*e^ annihilation. The 7 decay is thus 
further evidence for the existence of three colours of quarks. 


3.4 W decays 


We shall calculate the rate for the process W — f f' and then generalise the result to 
obtain the total W decay rate. The interaction Lagrangian (76) implies that the covariant 
matrix element for the process W (k) > f(p)f'(p) is 


M® = spurt — *s)vp (p) (4). (84) 


Here A describes the polarisation state of the W. The partial width is 


_ = 1 1 p 
Ww ) nl L ` (3)]2 
r( ff) 2Mw 3 pols we An Mw ` (85) 


where (2My)7! is the initial-state normalisation, 1/3 corresponds to an average of W 
polarisations, the sum is over both W and lepton polarisations, and p* is the final centre- 
of-mass (c.m.) 3-momentum. We use the identity 


ka ky 
3 4 (ke (k) = —9,, + 45 (86) 
a My 


for sums over W polarisation states. The result is that 


1 
L MOP = 9? Mi, — 5 (m? +m’) - (87) 


pols 
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for any process W — ff’, where m is the mass of f and m is the mass of f'. Recalling 
the relation between Gr and g’, this may be written in the simpler form 


Gr M3 2p* p? + 3EE' 


Mw My 
Here E = (p°? + m?)!/2 and E” = (p'? +m’)! are the c.m. energies of f and f’. The 
factor By reduces to 1 as m, m’ — 0. 


The present experimental average for the W mass (Kim 2001) is Mw = 80.451 + 
0.033GeV. Using this value, we predict [(W — e~e) = 227.8 + 2.3MeV. The widths to 
various channels are expected to be in the ratios 


rw > fF) = 


M2 (M2 
eD: Üp: T VQ: tid’: 6s —1:1: 1: ip ORAT. 3 [r+ SORT , (89) 
T T 


so aş( Mi) = 0.120 + 0.002 leads to the prediction Pio.(W) = 2.10 + 0.02GeV. This is 
to be compared with a value (Drees 2001) obtained at LEP II by direct reconstruction of 
W's: Tro (W) = 2.150 + 0.091GeV. Higher-order electroweak corrections, to be discussed 
in Section 5, are not expected to play a major role here. This agreement means, among 
other things, that we are not missing a significant channel to which the charged weak 
current can couple below the mass of the W. 


3.5 W pair production 


We shall outline a calculation (Quigg 1983) which indicates that the weak interactions 
cannot possibly be complete if described only by charged-current interactions. We consider 
the process z,(q) + %(¢') — W'* (k) + W- (k') due to exchange of an electron e^ with 
momentum p. The matrix element is 


GEM 


MOA 
v2 


(q) PRY 4 f (yu(g) (90) 


For a longitudinally polarised W*, this matrix element grows in an unacceptable fashion 
for high energy. Ín fact, an inelastic amplitude for any given partial wave has to be 
bounded, whereas M^ will not be. 


The polarisation vector for a longitudinal W* traveling along the z axis is 
e? (k) = (|k|,0,0, Mw) e k,/Mw , (91) 


with a correction which vanishes as |k| — oo. Replacing €" (k) by k,/Mw, using K =d- f 
and £u(g) = 0, we find 


MOA) w — 2G Mwi(g?) P (K)u(g), (92) 


MOP = 2Gp May 8g - 0g 09 — 4g - g'e £09]. (93) 


lepton pol. 


This quantity contributes only to the lowest two partial waves, and grows without bound 
as the energy increases. Such behavior is not only unacceptable on general grounds 


26 Jonathan L Rosner 


because of the boundedness of inelastic amplitudes, but it leads to divergences in higher- 
order perturbation contributions, e.g., to elastic vv scattering. 


Two possible contenders for a solution of the problem in the early 1970s were (1) 
a neutral gauge boson Z° coupling to vy and W*^W- (Glashow 1961, Weinberg 1967, 
Salam 1968), or (2) a left-handed heavy lepton E+ (Georgi and Glashow 1972a) coupling 
to V,W +. Either can reduce the unacceptable high-energy behavior to a constant. The Z° 
alternative seems to be the one selected in nature. In what follows we will retrace the 
steps of the standard electroweak theory, which led to the prediction of the W and Z and 
all the phenomena associated with them. 


4  Electroweak unification 


4.1 Guidelines for symmetry 


We now return to the question of what to do with the “neutral W” (the particle we called 
W? in the previous section), a puzzle since the time of Oskar Klein in the 1930s. The 
time component of the charged weak current 


Je? = Niya LL + Ury V D , (94) 


where N; and Ly are neutral and charged lepton column vectors defined in analogy with 
U; and Dz, may be used to define operators 


QU = [era , Qe Qh (95) 


which are charge-raising and charge-lowering members of an SU(2) triplet. If we define 
Qs = (1/2)[QC?, Q7], the algebra closes: [Qs, QU] = +Q. This serves to normalise 
the weak currents, as mentioned in the Introduction. 


The form (94) (with unitary V) guarantees that the corresponding neutral current will 


be 
n oli o d - - 
Te) = 5 [De Na — Lingle + UU, - Di, Di] . (96) 
which is diagonal in neutral currents. This can only succeed, of course, if there are equal 


numbers of charged and neutral leptons, and equal numbers of quarks of charge 2/3 and 
and quarks of charge —1/3. 


It would have been possible to define an SU(2) algebra making use only of a doublet. 


(Gell-Mann and Lévy 1960) 
uj _ u l (97) 
d L Vaad + V4,5 L 


which was the basis of the Cabibbo (1963) theory of the charge-changing weak interactions 
of strange and non-strange particles. If one takes Vaa = cos 6c, Vus = sin 9c, as is assumed 
in the Cabibbo theory, the u, d, s contribution to the neutral current JO is 


Jo ads = 5 (TET — cos? Pcdr Yadi — sin? OCS LY SL 


— sin &c cos Oc(drYaS1 + $1ud1)] . (98) 
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This expression contains strangeness-changing neutral currents, leading to the expectation 
of many processes like At — atvv, K? — ptu”, ..., at levels far above those observed. 
It was the desire to banish strangeness-changing neutral currents that led Glashow et 
al. (1970) to introduce the charmed quark c (proposed earlier by several authors on the 
basis of a quark-lepton analogy) and the doublet 


C 


(99) 
Vead + Vas 


L 


In this four-quark theory, one assumes the corresponding matrix V is unitary. By suitable 
phase changes of the quarks, all elements can be made real, making V an orthogonal 
matrix with Vig = Ves = cos8c, Vas = —V.4 = sinc. Instead of (98) one then has 


l,. - j » 
JO, ase = zru + ner — dL Yudi — 51781]; (100) 


which contains no flavour-changing neutral currents. 


The charmed quark also plays a key role in higher-order charged-current interactions. 
Let us consider K®-K° mixing. The CP-conserving limit in which the eigenstates are 
K (even CP) and A» (odd CP) can be illustrated using a degenerate two-state system 
such as the first excitations of a drum head. There is no way to distinguish between 
the basis states illustrated in Figure 9(a), in which the nodal lines are at angles of +45° 
with respect to the horizontal, and those in Figure 9(b), in which they are horizontal and 
vertical. 


Y) > S 
(p k O 
Figure 9. Basis states for first excitations of a drum head. (a) Nodal lines at +45° with 
respect to horizontal; (b) horizontal and vertical nodal lines. 


If a fly lands on the drum-head at the point marked “x”, the basis (b) corresponds to 
eigenstates. One of the modes couples to the fly; the other doesn’t. The basis in (a) is like 
that of (h°, K^), while that in (b) is like that of (Kj, K2). Neutral kaons are produced 
as in (a), while they decay as in (b), with the fly analogous to the 77 state. The short- 
lived state (A, in this CP-conserving approximation) has a lifetime of 0.089ns, while 
the long-lived state (~ A2) lives ~ 600 times as long, for 52ns. Classical illustration of 
CP-violating mixing is more subtle but can be achieved as well, for instance in a rotating 
reference frame (Rosner and Slezak 2001, Kostelecky and Roberts 2001). 
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The shared z7 intermediate state and other low-energy states like 7°, y, and s are 
chiefly responsible for CP-conserving KK mixing. However, one must ensure that 
large short-distance contributions do not arise from diagrams such as those illustrated in 
Figure 10. 


S i d 
W, wW 
d j s 
Figure 10. Higher-order weak contributions to K? B mizing due to loops with internal 


u,c,t quarks. 


If the only charge 2/3 quark contributing to this process were the u quark, one would 
expect a contribution to Am, of order 


Amy |, ~ 9! fz my sin? 6c cos? 65/162? Mj, ~ Gr f mx (g^ /162?), (101) 


where f is the amplitude for d$ to be found in a K®, and the factor of 167? is character- 
istic of loop diagrams. This is far too large, since Amy ~ ly, ^» G2. fz mi. However, the 
introduction of the charmed quark, coupling to —dsin 0c + s cos ôç, cancels the leading 
contribution, leading to an additional factor of [(m? — m2)/M?,] ln( Mij/ m) in the above 
expression. Using such arguments Glashow et al. (1970) and Gaillard and Lee (1974) 
estimated the mass of the charmed quark to be less than several GeV. (Indeed, early 
candidates for charmed particles had been seen by Niu, Mikumo, and Maeda 1971.) The 
discovery of the J/# (Aubert et al. 1974, Augustin et al. 1974) confirmed this prediction; 
charmed hadrons produced in neutrino interactions (Cazzoli et al. 1975) and in e*e^ 
annihilation (Goldhaber et al. 1976, Peruzzi et al. 1976) followed soon after. 


An early motivation for charm relied on an analogy between quarks and leptons. Hara 
(1964), Maki and Ohnuki (1964), and Bjorken and Glashow (1964) inferred the existence 
of a charmed quark coupling mainly to the strange quark from the existence of the p — v, 


doublet: 
Vy c 
" |: leptons > : quarks. (102) 
m s 


Further motivation for the quark-lepton analogy was noted by Bouchiat et al. (1972), 
Georgi and Glashow (1972b), and Gross and Jackiw (1972). In a gauge theory of the 
electroweak interactions, triangle anomalies associated with graphs of the type shown in 
Figure 11 have to be avoided. This cancellation requires the fermions f in the theory to 
contribute a total of zero to the sum over f of Qi L- Such a cancellation can be achieved 
by requiring quarks and leptons to occur in complete families so that the terms 


Leptons : (0)? (;) 4 (1 (-5) - -; (103) 


ws OOE] w 
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Y 


Figure 11. Ezrample of triangle diagram for which leading behaviour must cancel in a 
renormalisable electroweak theory. 


sum to zero for each family. 


We are then left with a flavour-preserving neutral current J, given by (100), whose 
interpretation must still be given. It cannot correspond to the photon, since the photon 
couples to both left-handed and right-handed fermions. At the same time, the photon is 
somehow involved in the weak interactions associated with W exchange. In particular, 
the W+ themselves are charged, so any theory in which electromagnetic current is con- 
served must involve a yW*tW coupling. Moreover, the charge is sensitive to the third 
component of the SU(2) algebra we have just introduced. We shall refer to this as SU(2),, 
recognising that only left-handed fermions v; transform non-trivially under it. Then we 
can define a weak hypercharge Y in terms of the difference between the electric charge Q 
and the third component 73; of SU(2), (weak isospin): 

Y 
Q-hit5. (105) 


Values of Y for quarks and leptons are summarised in Table 3. 


Particle(s) 


Table 3. Values of charge, I3,, and weak hypercharge Y for quarks and leptons. 


If you find these weak hypercharge assignments mysterious, you are not alone. They 
foliow naturally in unified theories (grand unified theories) of the electroweak and strong 
interactions. A "secret formula" for Y, which may have deeper significance (Pati and 
Salam 1973), is Y = 2/35 + (B — L), where Izr is the third component of “right-handed” 
isospin, B is baryon number (1/3 for quarks), and L is lepton number (1 for leptons 
such as e^ and ve). The orthogonal component of Izr and B — L may correspond to a 
higher-mass, as-yet-unseen vector boson, an example of what is called a Z'. The search 
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for Z' bosons with various properties is an ongoing topic of interest; current limits are 
quoted by the Particle Data Group (2000). 


The gauge theory of charged and neutral W’s thus must involve the photon in some 
way. It will then be necessary, in order to respect the formula (105), to introduce an 
additional U(1) symmetry associated with weak hypercharge. The combined electroweak 
gauge group will have the form SU(2); & U(1)y. 


4.2 Symmetry breaking 


Any unified theory of the weak and electromagnetic interactions must be broken, since 
the photon is massless while the W bosons (at least) are not. Àn explicit mass term in 
a gauge theory of the form m? Ai Alt violates gauge invariance. It is not invariant under 
the replacement (26). Another means must be found to introduce a mass. The symmetry 
must be broken in such a way as to preserve gauge invariance. 


A further manifestation of symmetry breaking is the presence of fermion mass terms. 
Any product y% may be written as 


VV = (V, + Va)(Ur + Ua) = ViUn + VaL. (106) 


using the fact that Y, = V(1-- 35)/2 and Yp = v(1— y;)/2. Since v; transforms as an 
SU(2), doublet but ig as an SU(2); singlet, a mass term proportional to v: transforms 
as an overall SU(2); doublet. Moreover, the weak hypercharges of left-handed fermions 
and their right-handed counterparts are different. Hence one cannot even have explicit 
fermion mass terms in the Lagrangian and hope to preserve local gauge invariance. 


One way to generate a fermion mass without explicitly violating gauge invariance is 
to assume the existence of a complex scalar SU(2), doublet ¢ coupled to fermions via a 
Yukawa interaction: 


— or 
Ly = —gy (V 5 Og + h.c.), o= | " | . (107) 


Thus, for example, with Y; = (i.e), and wr = eg, we have 
Lye = —gye(Venó* ent EL er the.). (108) 


If à? acquires a vacuum expectation value, (¢°) #4 0, this quantity will automatically 
break SU(2) ; and U(1)y, and will give rise to a non-zero electron mass. A neutrino mass 
is not generated, simply because no right-handed neutrino has been assumed to exist. 
(We shall see in the last section how to generate the tiny neutrino masses that appear 
to be present in nature.) The gauge symmetry is not broken in the Lagrangian, but 
only in the solution. This is similar to the way in which rotational invariance is broken 
in a ferromagnet, where the fundamental interactions are rotationally invariant but the 
ground-state solution has a preferred direction along which the spins are aligned. 


The d quark masses are generated by similar couplings involving Yy; = (ü.d); and 
Vg = dg, so that 


Lyda = —gvaltizo'dp + did dg + h.c.) . (109) 
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To generate u quark masses one must either use the multiplet 


40 
b= | n | -drQ*, (110) 


which also transforms as an SU(2) doublet, or a separate doublet of scalar fields 
g! 
¢ = | E (111) 
With Yọ; = (ü,d); and Yr = ug, we then find 
Lyn = —gyu (üp u; — dió uL T h.c.) (112) 
if we make use of ó, or 
Lyu =—gyu (ro ur + dr uz + h.c.) (113) 


if we use $'. For present purposes we shall assume the existence of a single complex 
doublet, though many theories (notably, some grand unified theories or supersymmetry) 
require more than one. 


4.3 Scalar fields and the Higgs mechanism 


Suppose a complex scalar field of the form (107) is described by a Lagrangian 


À 2, t 
Ly = (0,9) (0*9) — FO + ote. (114) 


Note the “wrong” sign of the mass term. This Lagrangian is invariant under the symmetry 
SU(2), & U(1)y. The field ó will acquire a constant vacuum expectation value which we 
calculate by asking for the stationary value of Lg: 


Oks _ tg) = 


We still have not specified which component of ¢ acquires the vacuum expectation value. 
At this point only oté = |t|? + |6°|? is fixed, and (Re dt, Im ¢t, Re @°, Im 6°) can 
range over the surface of a four-dimensional sphere. The Lagrangian (114) is, in fact, 
invariant under rotations of this four-dimensional sphere, a group SO(4) isomorphic to 
SU(2) & U(1). A lower-dimensional analogue of this surface would be the bottom of a 
wine bottle along which a marble rolls freely in an orbit a fixed distance from the centre. 


Let us define the vacuum expectation value of ó to be a real parameter in the $^ 
direction: 


0 
(6) = pA . (116) 


The factor of 1//2 is introduced for later convenience. We then find, from the discussion 
in the previous section, that Yukawa couplings of ó to fermions wy; generate mass terms 
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m = gviv/ V2. We must now see what such vacuum expectation values do to gauge 
boson masses. (For numerous illustrations of this phenomenon in simple field-theoretical 
models see Abers and Lee 1973, Quigg 1983, and Peskin and Schroeder 1995.) 


In order to introduce gauge interactions with the scalar field ¢, one must replace 0, 
by D, in the kinetic term of the Lagrangian (114). Here 


iu: 


Wi y 
D, = 9, -ig— E — ig By, (117) 


where the U(1)y interaction is characterised by a coupling constant g’ and a gauge field 
B,, and we have written g for the SU(2) coupling discussed earlier. It will be convenient 
to write ¢ in terms of four independent real fields (£*, 7) in a slightly different form: 


sci i 0 
$ = exp (& ) vty |: (118) 


v2 


We then perform an SU(2), gauge transformation to remove the € dependence of $, and 
rewrite it as 


0 


v 7] 
v2 


The fermion and gauge fields are transformed accordingly; we rewrite the Lagraugian for 
them in the new gauge. The resulting kinetic term for the scalar fields, taking account 
that Y = 1 for the Higgs field (107), is 


Lre = (D.o) (D"9) 


ig 


'T his term contains several contributions. 


o= (119) 


2 


(120) 


3 
Witiw? -W? 


wi Mae |= | 
Hu 


1. There is a kinetic term 1(8,m) (2n) for the scalar field 7. 
2. A term v, is a total divergence and can be neglected. 
3. There are WW, BBn, WW y^, and BBr? interactions. 


4. The v? term leads to a mass term for the Yang-Mills fields: 


v? , 
Laym = y [TV + (QW? Y] + (QW? - g B]. (121) 
The spontaneous breaking of the SU(2) & U(1) symmetry thus has led to the appearance 
of a mass term for the gauge fields. This is an example of the Higgs mechanism (Higgs 
1964). An unavoidable consequence is the appearance of the scalar field 7, the Higgs field. 
We shall discuss it further in Section 5. 


The masses of the charged W bosons may be identified by comparing (121) and (75): 


(gu)?/8 = M2,/2, | or My =gv/2. (122) 
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Since the Fermi constant is related to g/My, one finds 


Cro 21 , o0 v =D G; = 246 GeV. (123) 
V2 8Mi; w 
The combination gW? — g' B, also acquires a mass. We must normalise this com- 
bination suitably so that it contributes properly in the kinetic term for the Yang-Mills 
fields: 


1 i vi 1 y 
£kyu o 7 WW" — B, BY, (124) 
where 
Wi, = 3 Wi — OWi + gei WIWE, ^ B,,59,B, - 0,B,. (125) 
Setting 
cos 6 = I sind = — I (126) 


(fe gym (fe gym 
we may write the normalised combination ~ gw} — g B, which acquires a mass as 
Z, = W{ cos0 — B, sin£. (127) 
The orthogonal combination does not acquire a mass. It may then be identified as the 
photon: 
A, — B, cos 8 + W? sin. (128) 
The mass of the Z is given by 
(g? +g’)? _ M3 Mw(g 97)? _ Mw 
=, OI Mg = ———————— = ; 
8 2 g cos 0 
using (126) in the last relation. The W’s and Z's have acquired masses, but they are not 


equal unless g' were to vanish. We shall see in the next subsection that both g and g’ are 
nonzero, so one expects the Z to be heavier than the W. 


(129) 


It is interesting to stop for a moment to consider what has taken place. We started 
with four scalar fields +, $7, ¢°, and ¢°. Three of them [ó*, $7, and the combination 
(99 — 99)/iv/2] could be absorbed in the gauge transformation in passing from (118) to 
(119), which made sense only as long as (¢° + ġ)/ v2 had a vacuum expectation value v. 
The net result was the generation of mass for three gauge bosons W+, W`, and Z. 


If we had not transformed away the three components £* of ¢ in (118), the term £x 
in the presence of gauge fields would have contained contributions W,O^ó which mixed 
gauge fields and derivatives of ¢. These can be expressed as 

W,*9 = 0^(W,ó) — (^W, )ó (130) 
and the total divergence (the first term) discarded. One thus sees that such terms miz 
longitudinal components of gauge fields (proportional to O"W,) with scalar fields. It is 
necessary to redefine the gauge fields by means of a gauge transformation to get rid of 
such mixing terms. It is just this transformation that was anticipated in passing from 
(118) to (119). 

The three “unphysical” scalar fields provide the necessary longitudinal degrees of free- 
dom in order to convert the massless W+ and Z to massive fields. Each massless field 
possesses only two polarisation states (J, = +J), while a massive vector field has three 
(J, = 0 as well). Such counting rules are extremely useful when more than one Higgs field 
is present, to keep track of how many scalar fields survive being “eaten” by gauge fields. 
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4.4 Interactions in the SU(2) ® U(1) theory. 


By introducing gauge boson masses via the Higgs mechanism, and letting the simplest 
non-trivial representation of scalar fields acquire a vacuum expectation value v, we have 
related the Fermi coupling constant to v, and the gauge boson masses to gv or (g?4-9g 7) /?v. 
We still have two arbitrary couplings g and g’ in the theory, however. We shall show how 
to relate the electromagnetic coupling to them, and how to measure them separately. 


The interaction of the fermions with the gauge fields is described by the kinetic term 
£k — V Dy. Here, as usual, 
. ri i . QY 
p=- f -igt p. (131) 
The charged-W interactions have already been discussed. They are described by the terms 
(76) for leptons and (80) for quarks. The interactions of W? and B may be re-expressed 
in terms of A and Z via the inverse of (127) and (128): 


W3-Z,cosÜ--A,sinü, ^ B,-— —Z,sin6 + A, cosÓ. (132) 
Then the covariant derivative for neutral gauge bosons is 
Dau =P — iglaL( Z cos0-- Asin0) — ig (Q — IL )\(— Zsin0-- Acos). (133) 


Here we have substituted Y/2 = (Q — Iz). We identify the electromagnetic contribution 
to the right-hand side of (133) with the familiar one —ieQ A, so that 


e = g' cosÓ = gsin6. (134) 


The second equality, stemming from the demand that I3, A terms cancel one another 
in (133), is automatically satisfied as a result of the definition (126). Combining (126) 
and (134), we find 


! 1 1 1 
99 or z5. (135) 


the result advertised in the Introduction. 


e = 


The interaction of the Z with fermions may be determined from Eq. (133) with the 
help of (126), noting that g cos 0 + g' sin 8 = (g? + g°)!/? and g' sin = (g? + 9)? sin? 0. 
We find 

Duca =P — ieQ A — ig? + g^) ^ (UIs, — Qsin' 0) Z. (136) 


Knowledge of the weak mixing angle 0 will allow us to predict the W and Z masses. 
Using Gr /V2 = g?/8Mj, and gsinÓ = e, we can write 


1/2 
1 3 GeV 
TO | _ 37.3 Ge (137) 


VIG r 


if we were to use a^! = 137.036. However, we shall see in the next section that it is 
more appropriate to use a value of a^! ~ 129 at momentum transfers characteristic of 
the W mass. With this and other electroweak radiative corrections, the correct estimate 
is raised to Mw c 38.6 GeV/sin9, leading to the successful predictions (7). The Z mass 
is expressed in terms of the W mass by Mz = Mw/ cos 0. 


sin@  sinó 
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4.5 Neutral current processes 


The interactions of Z's with matter, 
Linz = (9? + 9^) Ua, — Qsin? 0) ZY, (138) 


may be taken to second order in perturbation theory, leading to an effective four-fermion 
theory for momentum transfers much smaller than the Z mass. In analogy with the rela- 
tion between the W boson interaction terms (76) and (80) and the four-fermion charged- 
current interaction (71), we may write 


HAE = 4Gr v2 [V (Ign — Q sin? 0) 7" vo] [Us (Jay — Q sin? 8)y, vA]. (139) 


where we have used the identity (g? + 97)/8M2 = Gr / V2 following from relations in the 
previous subsection. 


Many processes are sensitive to the neutral-current interaction (139), but no evidence 
for this interaction had been demonstrated until the discovery in 1973 of neutral-current 
interactions on hadronic targets of deeply inelastically scattered neutrinos (Hasert et 
al. 1973; Benvenuti et al. 1974). For many years these processes provided the most 
sensitive measurement of neutral-current parameters. Other crucial experiments (see, 
e.g., reviews by Amaldi et al. 1987 and Langacker et al. 1992) included polarised electron 
or muon scattering on nucleons, asymmetries and total cross sections in ete — pt po 
or rtr, parity violation in atomic transitions, neutrino-electron scattering, coherent 7° 
production on nuclei by neutrinos, and detailed measurements of W and Z properties. 
Let us take as an example the scattering of leptons on quarks to see how they provide a 
value of sin?0. In the next subsection we shall turn to the properties of the Z bosons, 
which are now the source of the most precise information. 


One measures quantities 


c(vÀ — v4...) oA...) 
= a 4. R; = T . 
Ry c(vA 3 u- t+...) (pA > ut +...) (140) 


These ratios may be calculated in terms of the weak Hamiltonians (71) and (139). It is 
helpful to note that for states of the same helicity (L or R, standing for left-handed or 
right-handed) scattering on one another, the differential cross section is a constant: 
do ( 
dQ 


where go is some reference cross section, while for states of opposite helicity, 


RR > RR) = (LL LL) = = (141) 


da do oo {1+ cos bem 2 
Ll = — ER) = — | — . 
dg (RL > RL) ig (LF > LR) dz ( 5 (142) 
Thus 
c(RR > RR) = c(LL > LL) = 30(RL > RL) = 30(LR- LR). (143) 


We first simplify the calculation by assuming the numbers of protons and neutrons are 
equal in the target nucleus, and neglecting the effect of antiquarks in the nucleon. (We 
shall use the shorthand v = v, and v = v,.) Then 


a(vu — vu) + c(vd > vd) 
a(vd > pu) 


R= a(vu > vu) + c(vd > vd) (144) 


R,= 
a(vu > ptd) 
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One can write the effective Hamiltonian (139) in the form 


HNC = ^ [Po (1 — 55)v] [a^ (1 — 9)uez (u) + à (1 + ys) uen(u) 


+ dy"(1 — ys)der(d) + d (1--5s)den(d) , — (145) 
where 
elu) = l 2 dnt (u) = 2 ain? 9 146 
u(u)= 5-38 ; erlu) = =z sin" 6, (146) 
eld) =-} + lane (d) = 1 gin? 0 147 
id) = 75 35m ; €R = 3m : (147) 
Taking account of the relations (143), one finds 
1 1 
R, = [e (P + 3lex Cor tear + alex or ; (148) 
Ry = [ex(u)}? + 3fen(u)? + fer(d)P +3fer(a)?, (149) 
where we have used the fact that o(vd — u^ d) = 3e(vu — pt d). The results are 
ol 3 20 ., _1 9 20 ., 
R,-5 sin O+ y: sin 8, Ry = 5 — sin 6+ sin 8. (150) 


If we consider also the antiquark content of nucleons, this result may be generalised 


(Llewellyn Smith 1983) by defining 


vN tX 
pa OND X) (151) 
a(vN — u- X) 
Instead of (150) one then finds 
l 225,9 . 1 . 5 1 
„= 02 4 Ql -sni642 lante. 
R 5 sin + g(1 +1) sin 6, R ; sin +z + —)sin 0 (132) 


Some experimental values of R,, Rz, and r are shown in Table 4 (Conrad et al. 1998). 
The relation between R, and R; as a function of sin? 0 is plotted in Figure 12. This result 
has a couple of interesting features. 


Experiment R, Rz | r 
CHARM | 0.3091 + 0.0031 | 0.390 + 0.014 | 0.456 + 0.011 
CDHS 0.3135 + 0.0033 | 0.376 + 0.016 | 0.409 + 0.014 
Average | 0.3113 + 0.0023 | 0.384 + 0.011 | 0.429 + 0.011 


Table 4. Neutrino neutral-current parameters. 


The observed E; is very close to its minimum possible value of less than 0.4. Initially 
this made the observation of neutral currents quite challenging. Note that R, is even 
smaller. Its value provides the greatest sensitivity to sin? 0. It is also more precisely 
measured than R; (in part, because neutrino beams are easier to achieve than antineutrino 
beams). The effect of r on the determination of sin? @ is relatively mild. 
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Figure 12. The Weinberg-Salam “nose” depicting the relation between R, and Rj. 
The solid line corresponds to r = 0.429, close to the actual situation; the dashed line 
corresponds to the idealised case r = 1/3 in which antiquarks in the nucleon are neglected. 
The plotted point with error bars corresponds to the average of measured values. 


A recent determination of sin? 0 (Zeller et al. 1999), based on a method proposed by 
Paschos and Wolfenstein (1973), makes use of the ratio 


R= a(vN > vX) — c(vN > PX) _ RH,—- rH; _ 1 L sin? (153) 

ofo N > u- X) - c(v N > u* X) l-r 2 
In these differences of neutrino and antineutrino cross sections, effects of virtual quark- 
antiquark pairs in the nucleon (“sea quarks,” as opposed to “valence quarks”) cancel one 
another, and an important systematic error associated with heavy quark production (as 


in vs 3 uc) is greatly reduced. The result is 
sin? 9 (en-*hel — 0,2253 + 0.0019(stat.) + 0.0010(syst.) . (154) 


which implies a W mass 
My = Mz cos0 =h) — 80.21 +0.11 GeV. (155) 


The “on-shell” designation for sin? 0 is necessary when discussing higher-order electroweak 
radiative corrections, which we shall do in the next section. 
A more recent analysis by Zeller et al. (2001) finds 
sin? 9 (on-shel) — 0,2277 + 0.0014(stat..) + 0.0008(syst.) . (156) 


equivalent to My = 80.136 + 0.084GeV. Incorporation of this result into the electroweak 
fits described in the next section is likely to somewhat relax constraints on the Higgs 
boson mass: See Rosner (2001). 
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4.6 Z and top quark properties 


We have already noted the prediction and measurement of the W mass and width. The Z 
mass and width are very precisely determined by studying the shape of the cross section 
for electron-positron annihilation as one varies the energy across the Z pole. The results 


(LEP Electroweak Working Group [LEP EWWG] 2001) are 
Mz = 91.1875 + 0.0021 GeV, Dz = 2.4952 + 0.0023 GeV. (157) 


In much of the subsequent discussion we shall make use of the very precise value of Mz as 
one of our inputs to the electroweak theory; the two others, which will suffice to specify 
all parameters at lowest order of perturbation theory, will be the Fermi coupling constant 
Gp = 1.16637(1) x 10-° GeV? and the electromagnetic fine-structure constant, evolved 
to a scale M2: a^! (MS) ( M2) = 128.933 + 0.021 (Davier and Hocker 1998). This last 
quantity depends for its determination upon a precise evaluation of hadronic contributions 
to vacuum polarisation, and is very much the subject of current discussion. 


The relative branching fractions of the Z to various final states may be calculated on 
the basis of (138). One may write this expression as 


Lzy — (9 tg)? f. Zl. —rs)ar (19 3s)an]f - (138) 
The values of a; and ag for each fermion are shown in Table 5. 
| Channel aL aR Partial Number of | Subtotal 
width (MeV) | channels (MeV) 
vy 1 0 165.9 3 498 
il 1 (—4 + sin?) ‘sin? @ 83.4 3 250 
uü, cé | P(i-$sm'8) | i (-3sin?6) 296.5 2 593 
dd, ss | i(-i-ism'0)| i(isim6) 382.1 2 764 
LEE: (-3+5sin’9) | i(ism?0) | 3728 1 373 
L Total 2478 


Table 5. Contributions to Uz predicted in lowest-order electroweak theory (including 
leading-order QCD corrections to hadronic channels). Here we have taken sin? 0 — 0.231 


and as(M2) = 0.12. 


The partial width of Z into ff is 
x 4Gr 
r(Z > ff)= 3a s, 5 M a; +ap)Ne, (189) 
where n, is the number of colours of fermions f: 1 for leptons, 3 for quarks. 


The predicted partial width for each Z — vv channel is independent of sin? 6: 


Gr Mj 


Vi 519; = 165.9 MeV (160) 


T'(Z > vv) = 
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using the observed value of Mz. The partial decay rates to other channels are expected 
to be in the ratios i 
vD: ete: uu: dd= 


32 8 
1:1-— 4sin?0 -- 8sin*0 : 3 — 8sin^ 0+ — sin’ 6 : 3 — 4sin 6 +z sin 8, (161) 


or 1: 0.503: 1.721: 2.218 for sin? 0 — 0.231. A small kinematic correction for the non-zero 
b quark mass leads to a suppression factor 


4m? 1/2 2m? 4m? 
ð; —(1-—2 142) + 1— b 
bb ( i) fv M fa M2]]|' (162) 


where fy and f4 = 1— fy are the relative fractions of the partial decay width proceeding 
via the vector (~ ar-J-ag) and axial-vector (~ arj—ag) couplings. For sin?@ = 0.23, 
fv = 1/3, fa ~= 2/3, and y ~ 0.988. A further correction to I'(Z — bb), important for 
the precise determinations in the next section, is associated with loop graphs associated 
with top quark exchange (see the review by Chivukula 1995), and is of the same size, 
about 0.988. Taking a correction factor (1 + as/z) with as(M2) = 0.12 for the hadronic 
partial widths of the Z, we then predict the contributions to Tz listed in Table 5. (The 
tt channel is, of course, kinematically forbidden.) 


The measured Z width (157) is in qualitative agreement with the prediction, but 
above it by about 0.7%. This effect is a signal of higher-order electroweak radiative 
corrections such as loop diagrams involving the top quark and the Higgs boson. Similarly, 
the observed value of T (Z — ete7), assuming lepton universality, is 83.984 + 0.086 MeV, 
again higher by 0.7% than the predicted value of 83.4MeV. We shall return to these effects 
in the next section. 


The width of the Z is sensitive to additional vr pairs. Clearly there is no room for an 
additional light pair coupling with full strength. Taking account of all precision data and 
electroweak corrections, the latest determination of the "invisible" width of the Z (see 
the compilations by the LEP EWWG 2001 and by Langacker 2001) fixes the number of 
“light” neutrino species as N, = 2.984 + 0.008. 


The Z is produced copiously in ete~ annihilation when the center-of-mass energy ys 


is tuned to Mz. The Stanford Linear Collider (SLC) and the Large Electron-Positron 
Collider at CERN (LEP) exploited this feature. The cross section of production of a final 
state f near the resonance, ignoring the effect of the virtual photon in the direct channel, 
should be 

T(Z 2 ete" )T'(Z > f) 

(s - Mz)? + Mi 

Setting B.+.- = (Z9 — e*e-)/Tz e 3.3796 we see that at resonance the peak total cross 
section should be aya = 127B.+e- / M2 ~ 59.4 nb, corresponding to 


olete — Z? — f) = 120 


(163) 


c(e*e- = Z? — all) JL 9Bete- 
2 


Hy = ~ 5000. (164) 


olete > ntu-) — a 
This is a spectacular value of R, which is only a few units in the range of lower-energy 
ete colliders. Of course, not all of the cross section at the Z peak is visible: Nearly 12 nb 
goes into neutrinos! Another 6 nb goes into charged lepton pairs, leaving Opeak, hadrons = 


41.541 + 0.037 nb. 
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We close this subsection with a brief discussion of spin-dependent asymmetries at 
the Z. These are some of the most powerful sources of information on sin? @. They 
have been measured both at LEP (through forward-backward asymmetries) and at SLC 
(through the use of polarised electron beams). 


The discussion makes use of an elementary feature of vector- and axial-vector cou- 
plings. Processes involving such couplings to a real or virtual particle (such as the Z) 
always conserve chirality. In the direct-channel reactions eet — Z — ff this means 
that a left- (right-)handed electron only interacts with a right- (left-) handed positron, 
and if the final fermion f is left- (right-)handed then the final anti-fermion f will be right- 
(left-) handed. Moreover, such reactions have characteristic angular distributions, with 


drei 1 pap (EER) ats 
merat ay (MEE) aus 
de Fi = oofa)? (ah) (icy ; (167) 
wien > f) = ea) (af)? ee) (168) 


where gg is some common factor, and the az pg are given in Table 5. Several asymmetries 
can be formed using these results. 


The polarised electron left-right asymmetry compares the cross sections for producing 
fermions using right-handed and left-handed polarised electrons, as can be produced and 
monitored at the SLC. The cross section asymmetry is given by 


Af g(hadrons) 


a(eze* — hadrons) — e(ege* — hadrons) 
o(e,e+ — hadrons) + e(ege* — hadrons) 
(a)? - (aS)? _ 1 — 4sin?8 

(a2)? + (a%)?  1— 4sin? 8 - 8sin* 0" 


(169) 


The measured value (LEP EWWG 2001) A7 &(hadrons) = 0.1514: 0.0022 corresponds to 
sin? à = 0.23105 + 0.00028 using this formula. (We shall discuss small corrections in the 
next section.) 


The forward-backward asymmetry in ete~ — ff uses the fact that 


1 70 1 0 8 
— 2 2— - 
(f J) 408 8(1 + coso) +2, (f + ff.) dos 0(1 + cos) 5, 070 


so that 
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These quantities can be measured not only for charged leptons, but also for quarks 
such as the 6, whose decays allow for a distinction to be made (at least on a statistical 
basis) between 6 and b. 


The discovery of the top quark by the CDF (1994) and DO (1995) Collaborations cul- 
minated nearly two decades of detector and machine work at the Fermilab Tevatron. A 
ring of superconducting magnets was added to the 400GeV Fermilab accelerator, more 
than doubling its energy. Low-energy rings were added to accumulate and store antipro- 
tons, which were then injected into the superconducting ring and made to collide with 
oppositely-directed protons at a centre-of-mass energy of 1.8TeV. The top quarks were 
produced in the reaction pp > tt 4- .... 


The top quark's mass is currently measured to be m, = 174.3 + 5.1 GeV. It cou- 
ples mainly to b, as expected in the pattern of couplings discussed in Section 3. One 
determination (see Gilman, Kleinknecht, and Renk 2000 for details) is that 

IVa 
IVa]? + [Vis]? + Viel? 
This result makes use of the measured fraction of the decays t — bet v, in top semileptonic 
decays. 


= 0.99 + 0.29. (172) 


The top quark is the only quark heavy enough to decay directly to another quark 
(mainly 5) and a real W. Its decay width is 


Grm M&WV Mà 
T(t > W*b)- AN, (: - 1) (12) ~ 1.53 GeV. (173) 


This is larger than the typical spacing between quarkonium levels (see Figures 4 and 5). 
and so there is not expected to be a rich spectroscopy of ff levels, but only a inild 
enhancement near threshold of the reaction e*e^ — t£, associated with the production 
of the 1S level (Kwong 1991, Strassler and Peskin 1991). A good review of present and 
anticipated top quark physics is given by Willenbrock (2000). 


5 Higgs boson and beyond 


5.1 Searches for a standard Higgs boson 


Let us assume that all quark and lepton masses and all W and Z masses arise froin the 
vacuum expectation value of a single Higgs boson: (9°) = v/V/2. where the strength of 
the Fermi coupling requires v = 246GeV. The Yukawa coupling gy ; (107) for a fermion f 
is related to the fermion's mass: gy ; = Vm; /v. (It is a curious feature of the top quark's 
mass that, within present errors, gy; = 1. Since fermion masses “run” with scale p, it is 
not clear how fundamental this relation is.) Those quarks with the greatest mass then 
are expected to have the greatest coupling to the physical Higgs boson H = /2¢° — v. 
(Here we use H to denote the field represented by 7 in the previous section.) 


The Higgs boson has a well-defined coupling to W's and Z's as a result of the discussion 
in the previous section. The term (D,,¢)'(D“¢) in the Lagrangian leads to 


(9? - 9?) ^? HMz 
2 


Cuww —- gHMw(W;W"*), — £uzz— (Z,Z"). (174) 
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To lowest order, one find Lyzy = Luy = 0. 


Processes involving the couplings (174) include qd — Wu — W +H and especially 


+,- 7 
ee >o Zreal or virtual — Zyirtual or real + H , (1 75) 


where the final Z^ can be detected (for example) via its decay to e*e^, ut u^, or even its 
existence inferred from its vi decay. For a virtual intermediate and real final Z, the cross 
section (Quigg 1983) is 
ra? (p? +3M2) 2p* 
te’ + ZH)- A 1 — 4sin? 6 + 8sint 6) —— , 1 

o(e*e ) Fan boost (ME y ( sin’ 6 + 8sin ) Vs (176) 
where p* is the final c.m. 3-momentum. This cross section behaves as 1/s for large s (as 
does any cross section for production of qd, p^ p^ ,...), so that as s > oc, 


a(ete > ZH) 1—4sin? 0 +8sinfð 1 
o(ete- > ptu) 128sin'Ócos!ü 8^ 


(177) 


At very high energies, the Higgs boson can be produced by means of WtW- and ZZ 
fusion; the (virtual) W's and Z's can be produced in either hadron-hadron or lepton-lepton 
collisions. A further proposal for producing Higgs bosons is by means of muon-muon 
collisions. 


For Higgs bosons far above WW and ZZ threshold, one expects (Eichten et al. 1984) 


3G. 3 


1617/2 


as one can show with the help of (174). The longitudinal degrees of freedom of the W and 
Z provide the dominant contribution to the decay width in this limit. For My = 1TeV, 
this relation implies that the Higgs boson’s width will be nearly 500GeV. Such a broad 
object will be difficult to separate from background. However, mixed signals for a a much 
lighter Higgs boson have already been received at LEP. 


At the very highest LEP energies attained, ys < 209 GeV, the four LEP collabo- 
rations ALEPH, DELPHI, L3, and OPAL have presented combined results (LEP Higgs 
Working Group 2001) which may be interpreted either as a lower limit on the Higgs bo- 
son mass of 114.1GeV, or as a weak signal for a Higgs boson of mass My ~ 115.6GeV 
produced by the above process. This latter interpretation is driven in large part by the 
ALEPH data sample (Barate et al. 2001). The main decay mode of a Higgs boson in this 
mass range is expected to be bb, with rtr- taking second place. 


Ty =[(H > W'W-)4T(H2ZZ)- (178) 


M 
3. , H 
Mp = 60 GeV (x Gv) 


LEP now has ceased operation in order to make way for the Large Hadron Collider 
(LHC), which will collide 7TeV protons with 7TeV protons and should have no problem 
producing such a boson. The LHC is scheduled to begin operation in 2006. In the 
meantime, the Fermilab Tevatron has resumed pj collider operation after a hiatus of 
5 years. Its scheduled “Run IF" is initially envisioned to provide an integrated luminosity 
of 2 fb~!, which is thought to be sufficient to rival the sensitivity of the LEP search (Carena 
et al. 2000), making use of the subprocess qd — Wirtu > W + H. With 10 fb^! per 
detector, a benchmark goal for several years of running with luminosity improvements, it 
should be possible to exclude a Higgs boson with standard couplings nearly up to the ZZ 
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threshold of 182GeV, and to see a 3o signal if My < 125 GeV. Other scenarios, including 
the potential for discovering the Higgs boson(s) of the Minimal Supersymmetric Standard 
Model (MSSM) are given by Carena et al. (2000). Meanwhile, we shall turn to the wealth 
of precise measurements of electroweak properties of the Z, W, top quark, and lighter 
fermions as indirect sources of information about the Higgs boson and other new physics. 


5.2 Precision electroweak tests 


We have calculated processes to lowest electroweak order in the previous section, with the 
exception that we took account of vacuum polarisation in the photon propagator, which 
leads to a value of o^! closer to 129 than to 137.037 at the mass scale of the Z. The lowest- 
order description was found to be adequate at the percent level, but many electroweak 
measurements are now an order of magnitude more precise. As one example, we found that 
the predicted total and leptonic Z widths both fell short of the corresponding experimental 
values by about 0.7%. Higher-order electroweak corrections are needed to match the 
precision of the new data. These corrections can shed fascinating light on new physics, 
as well as validating the original motivation for the electroweak theory (which was to be 
able to perform higher-order calculations). 


We shall describe a language introduced by Peskin and Takeuchi (1990) for precise 
electroweak tests which allows the constraints associated with nearly every observable to 
be displayed on a two-dimensional plot. The Standard Model implies a particular locus 
on this plot for every value of m, and My, so one can see how observables can vary 
with m, (not much, now that m, is so well measured) and My. Moreover, one can spot 
at a glance if a particular measurement is at variance with others; this can either signify 
physics outside the purview of the two-dimensional plot, or systematic experimental error. 


The corrections which fall naturally into the two-dimensional description are those 
known as oblique corrections. The name stems from the fact that they do not directly affect 
the fermions participating in the processes of interest, but appear as vacuum polarisation 
corrections in gauge boson propagators. In that sense processes which are sensitive to 
oblique corrections have a broad reach for discovering new physics, since they do not rely 
on a new particle's having to couple directly to the external fermion in question. 


The oblique correction first identified by Veltman (1977), still the most important, is 
that due to top quarks in W and Z boson propagators. The large splitting between the 
top and bottom quarks’ masses violates a custodial SU(2) symmetry (Sikivie et al. 1980) 
responsible for preserving the tree-level relation My = Mz cos 0 mentioned in the previous 
section. As a result, an effect is generated which is equivalent to having a Higgs triplet 
vacuum expectation value. 

For the photon, gauge invariance prohibits contributions quadratic in fermion masses, 
but for the W and Z, no such prohibition applies. The vacuum polarisation diagrams with 
Wt — tb > Wt and Z 9 (tt, bb) — Z lead to a modification of the relation between 
Gr, coupling constants, and Mz for neutral-current exchanges: 


Gr _ P +g’ GF g x97 ~ 1 4 Crm 
J/à 8M} 27 gM ^ "^ "sey. 
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The Z mass is now related to the weak mixing angle by 


na 


= V2Grpsin? 8 cos? 8 ' 


MZ (180) 
where we have omitted some small terms logarithmic in m,. À precise measurement of Mz 
now specifies 0 only if m, is known, so 6 = 6(m,) and hence Mj, = no/(v2G r sin? 0) is 
also a function of m4. 


The factor of p in (179) will multiply every neutral-current four-fermion interaction in 
the electroweak theory. Thus, for, example, cross sections for charge-preserving interac- 
tions of neutrinos with matter will be proportional to p?, while parity-violating neutral- 
current amplitudes (to be discussed below) will be proportional to p. Partial decay widths 
of the Z, since they involve the combination g? + g?, will be proportional to p. A large 
part of the 0.7% correction mentioned previously is due to p > 1. The observed values of 
Mw/Mz = pcos and sin? 0 also are much more compatible with each other for a value 
of p exceeding 1 by about a percent. 


The W and Z propagators are also affected by virtual Higgs-boson states due to the 
couplings (174). Small corrections, logarithmic in My, affect all the observables, but 
notably p. 


In order to display dependence of electroweak observables on such quantities as the top 
quark and Higgs boson masses m, and My, we choose to expand the observables about 
“nominal” values calculated by Marciano (2000) for specific m, and My. We thereby 
bypass a discussion of “direct” radiative corrections which are independent of mi, My, 
and new particles. We isolate the dependence on m,, My, and new physics arising from 
*oblique" corrections associated with loops in the W and Z propagators. 


For m, = 174.3GeV, My = 100GeV, the measured value of Mz leads to a nominal 
expected value of sin?0,g = 0.2314. In what follows we shall interpret the effective 
value of sin? @ as that measured via leptonic vector and axial-vector couplings, namely 


sin? ber = (1/4) — [st /gÀ])- 
Defining the parameter T by Ap = oT, we find 


3 m? — (174.3 GeV)? 3 Mg 
eL 3 - n . 
167 sin? 0 M2, 8mcos?0 — 100 GeV 


(181) 


The weak mixing angle 0, the W mass, and other electroweak observables now depend on 
m, and Mg. 


The weak charge-changing and neutral-current interactions are probed under a num- 
ber of different conditions, corresponding to different values of momentum transfer. For 
example, muon decay occurs at momentum transfers small with respect to My, while the 
decay of a Z into fermion-antifermion pairs imparts a momentum of nearly Mz/2 to each 
member of the pair. Small “oblique” corrections, logarithmic in m, and My, arise from 
contributions of new particles to the photon, W, and Z propagators. Other (smaller) 
"direct? radiative corrections are important in calculating actual values of observables. 


We may then replace the lowest-order relations between G p, couplings, and masses by 


2 2 n 
Gr 9 ( 22). Grp _ g tg ( aSz ). (182) 


V2 8M} 4sin? 0 /2 8M 4sin? 0 cos? 6 
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where Sy and Sz are coefficients representing variation with momentum transfer. To- 
gether with T, they express a wide variety of electroweak observables in terms of quantities 
sensitive to new physics. (The presence of such corrections was noted quite early by Velt- 
man 1977.) The Peskin and Takeuchi (1990) variable U is equal to Sw — Sz, while 
s= Sz. 

Expressing the “new physics” effects in terms of deviations from nominal values of top 
quark and Higgs boson masses, we have the expression (181) for T, while contributions of 
Higgs bosons and of possible new fermions U and D with electromagnetic charges Qy and 
Qp to Sy and Sz, in a leading-logarithm approximation, are (Kennedy and Langacker 
1990) 


= 1 My — my 
Sz = ow In a Gevja * 22 Nc (icq e). (183) 
Sy = >h Mu AEN (1-4 I m) 184 
W 7 6x |" 100 GeV/2 c pla) (184) 


The expressions for Sy and Sz are written for doublets of fermions with Nc colours and 
my > mp > mz, while Q = (Qu + Qp)/2. The sums are taken over all doublets of new 
fermions. In the limit my = mp, one has equal contributions to Sy and Sz. For a single 
Higgs boson and a single heavy top quark, (183) and (184) become 


S 1 In Mn — 2n m 
Z ' 6z| 100 GeV/c 1743 GeV/e 
1 My my 
Sw = x [in 100 Gev/2 * 3I i743 i , (185) 


where the leading-logarithm expressions are of limited validity for My and m, far from 
their nominal values. (We shall plot contours of S and T for fixed m, and My values 
without making these approximations.) A degenerate heavy fermion doublet with N. 
colours thus contributes ASz = ASw = N,/6z. For example, in a minimal dynamical 
symmetry-breaking (*technicolour") scheme, with a single doublet of N, = 4 fermions, 
one will have AS = 2/32 ~ 0.2. This will turn out to be marginally acceptable, while 
many non-minimal schemes, with large numbers of doublets, will be seen to be ruled out. 


Many analyses of present electroweak data within the S, T' rubric are available (e.g., 
Swartz 2001). We shall present a “cartoon” version after discussing possible extensions of 
the Higgs system. Meanwhile we note briefly a topic which will not enter that discussion. 


The anomalous magnetic moment of the electron and muon have been measured ever 
more precisely. The latest measurement of the y+ (Brown et al. 2001), performed in a 
special storage ring at Brookhaven National Laboratory, gives 


-2 
Qut exp = (F). = 11 659 202(14)(6) x 107! (1.3 ppm), (186) 


The theoretical value (CPT invariance implies a,+ = ap- ) is 


Auth = 11 659 177(7) x 1079. (0.6 ppm), (187) 
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being the sum of three terms: 


au QED = 11 658 470.56(0.29) x 107! (0.025 ppm), 
Qy,weak = 19.1(0.4) x 10710 (0.03 ppm) , 
691(7) x 10-9 (0.6 ppm), 


M 


Qu, had 


where we have incorporated a recently-discovered sign change in the hadronic light-by- 
light scattering contribution (Knecht and Nyffeler 2001; Hayakawa and Kinoshita 2001). 
The difference, 

aut exp — Gut th = 25(17) x 1071, (188) 


is not yet known precisely enough to test the expected weak contribution. Results of 
analysing a larger data sample are expected shortly. 


5.3 Multiple Higgs doublets and Higgs triplets 


There are several reasons for introducing a more complicated Higgs boson spectrum. Rea- 
sons for introducing separate Higgs doublets for u-type and d-type quarks include higher 
symmetries following from attempts to unify the strong and electroweak interactions, and 
supersymmetry. We examine the simplest model with more than one Higgs doublet, in 
which a single doublet couples to d-type quarks and charged leptons, and a different dou- 
blet couples to u-type quarks. This model turns out to naturally avoid flavour-changing 
neutral currents associated with Higgs exchange (Glashow and Weinberg 1977). 


Let us denote by ¢, the Higgs boson coupling to u-type quarks and by $4 the boson 
coupling to d-type quarks and charged leptons. We let 


(bu) = v,/ V2, (a) = va/ V2. (189) 
The contribution of ¢, and ġa to W and Z masses comes from 
£x + (Dyahu) (Dba) + (Dapa) (Da) . (190) 


We find the same Ww? — B, mixing pattern as before, and in fact this pattern would remain 
the same no matter how many Higgs doublets were introduced. The parameters v, and vg 
may be related to the quantity v = 246GeV introduced earlier by v? +13 = 1, whereupon 
all previous expressions for My and Mz remain valid. One would have v? = NT for 


any number of doublets. 


The quark and lepton couplings to Higgs doublets are enhanced if there are multiple 
doublets. Since m, = gy v4/ V2 (q = u or d) and v, < v, one has larger Yukawa couplings 
than in the standard single-Higgs model. A more radical consequence, however, of multiple 
doublets in the SU(2), gauge theory is that there are not enough gauge bosons to “eat” 
all the scalar fields. In a two-doublet model, five “uneaten” scalars remain: two charged 
and three neutral. The phenomenology of these is well-described by Gunion et al. (1990). 


The prediction Mz = My / cos is specific to the assumption that only Higgs doublets 
of SU(2); exist. [SU(2), singlets which are neutral also have Y = 0, and do not affect 
W and Z masses] If triplets or higher representations of SU(2) exist, the situation is 
changed. We shall examine two cases of triplets: a complex triplet with charges (4---.--.0) 
and one with charges (+,0,-). 
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Consider first a complex triplet of the form 


t+ +1 
$= p+ , Is = 0 . (191) 
oo —1 


Since Q = Iyı + (Y/2), one must have Y = 2 for this triplet. In calculating |D,,®|? we will 
need the triplet representation for weak isospin: 


1 i 010 . 0 -1 0 
D 

l = 0 , h-—|101!, h= —|1 0 -1/. (192 
3 l NI 2 V2 ( ) 

—1 0 10 o 0 

The result, if ($9) = Vi. / 2, is that 
2 VP g r1\2 242 3 n2 

(ID,8P) = A 1 EUW? + Q*y] + (-9W? + g BY y . (193) 


The same combination of W? and B gets a mass as in the case of one or more Higgs 
doublets, simply because we assumed that it was a neutral Higgs field which acquired a 
vacuum expectation value. Electromagnetic gauge invariance remains valid; the photon 
does not acquire a mass. However, the ratio of W and Z masses is altered. In the presence 
of doublets and this type of triplet, we find 


2 2 12 
Mi, = i H-3V? i), M3 = (42) (v? AV? 1), (194) 


so the ratio p = (Mw/Mz cos8)? is no longer 1, but becomes 


v? --2V? , 


This type of Higgs boson thus leads to p « 1. 
À complex triplet 


$a! (196) 


is characterised by Y = 0. If we let (6?) = Vio/V/2, we find by an entirely similar 
calculation, that 


g . . . g? +g”? 
Mè, = qU + 4V 5) . M3 = ( 1 ) vè. (197) 
Here we predict 4V2, 
p=1+ 3 , (198) 


so this type of Higgs boson leads to p > 1. 
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We now examine a simple set of electroweak data (Rosner 2001), updating an earlier 
analysis (Rosner 1999) which may be consulted for further references. (See also Peskin 
and Wells 2001.) We omit some data which provide similar information but are less 
constraining. Thus, we take only the observed values of Mw as measured at the Fermilab 
Tevatron and LEP-II, the leptonic width of the Z, and the value of sin? &,g as measured 
in various asymmetry experiments at the Z pole in ete” collisions. We also include parity 
violation in atoms, stemming from the interference of Z and photon exchanges between 
the electrons and the nucleus. The most precise constraint at present arises from the 
measurement of the weak charge (the coherent vector coupling of the Z to the nucleus), 
Qw = p(Z — N — AZ sin? 0), in atomic cesium. The prediction Qw (Cs) = —73.19 + 0.13 
is insensitive to standard-model parameters once Mz is specified; discrepancies are good 
indications of new physics. 


The inputs, their nominal values for m, = 174.3GeV and My = 100GeV, and their 
dependences on S and T are shown in Table 6. We do not constrain the top quark mass; 
we display its effect on S and T' explicitly. Each observable specifies a pair of parallel 
lines in the S-T' plane. The leptonic width mainly constrains T; sin? 6°f provides a 
good constraint on S with some T-dependence; and My lies in between. Atomic parity 
violation experiments constrain $ with almost no T' dependence. Although the errors on 
S they entail are too large to have much impact, we include them for illustrative purposes. 
Since the slopes associated with constraints are very different, the resulting allowed region 
is an ellipse, shown in Figure 13. [Note added: Milstein and Sushkov (2001) have noted 
that a correction due to the strong nuclear field changes the central value of Qw (Cs) in 
Table 6 to ~ —72.2, while Dzuba et al. (2001) include this and further corrections to 
obtain Qw = —72.39 + 0.58.] 


Quantity Experiment Theory | 
Mw ( GeV/c?) | 80.451 + 0.033 ? 80.385 9 — 0.29S + 0.45T 
Te(Z) (MeV) | 83.984 + 0.086 ? 84.011 ® — 0.185 + 0.78T 

sin? 6f 0.23152 + 0.00017 © | 0.23140 9 + 0.003625 — 0.00258T 
Qw (Cs) —72.5 X: 0.8 —73.19 — 0.8005 — 0.007T 
Qw (Tl) —115.0 + 4.5 —116.8 — 1.175 — 0.06T 


Table 6. Electroweak observables described in fit. The superscripts refer to (a) Charlton 
(8001), (b) Marciano (2000) and (c) LEP EWWG (2001). References for atomic physics 
experiment and theory are given by Rosner (2001). 


Figure 13 also shows predictions by Peskin and Wells (2001) of the standard elec- 
troweak theory. Nearly vertical lines correspond, from left to right, to Higgs boson masses 
Mu = 100, 200, 300, 500, 1000 GeV; drooping curves correspond, from top to bottom, to 
+lo, central, and —1lo values of m; = 174.345.1 GeV. 


In the standard model, the combined constraints of electroweak observables such as 
those in Table 6 and the top quark mass favour a very light Higgs boson, with most 
analyses favouring a value of My so low that the Higgs boson should already have been 
discovered. The efficacy of a small amount of triplet symmetry breaking has recently been 
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Figure 13. Regions of 68% (inner ellipse) and 90% (outer ellipse) confidence level 
values of S and T based on the comparison of the theoretical and experimental electroweak 
observables shown in Table 6. Details are given in the tezt. 


stressed in a nice paper by Forshaw et al. (2001). It is also implied in the discussions of 
Dobrescu and Hill (1998), Collins e£ al. (2000), He et al. (2001), and Peskin (2001). 


The standard model prediction for S and T curves down quite sharply in T as My 
is increased, quickly departing from the region allowed by the fit to electroweak data. 
(Useful analytic expressions for the contribution of a Higgs boson to S and T' are given 
by Forshaw et al. 2001.) However, if a small amount of triplet symmetry breaking is 
permitted, the agreement with the electroweak fit can be restored. As an example. a 
value of V, o/v = 0.03 permits satisfactory agreement even for My = 1TeV, as shown by 
the vertical line in the Figure. 


5.4 Supersymmetry, technicolour, and alternatives 


What could lie beyond the standard model? The odds-on favourite among most theorists 
is supersymmetry, an extremely beautiful idea which may or may not be realised at the 
electroweak scale, but which almost certainly plays a role at the Planck scale at which 
space and time first acquire their meaning. 


The simplest illustration of supersymmetry (in one time and no space dimensions!) 
does back to Darboux in 1882, who factored second-order differential operators into the 
product of two first-order operators. Dirac's famous treatment of the harmonic oscillator, 
writing its Hamiltonian as H = hw(ata+ 4), is an example of this procedure, which was 
generalised by Schrodinger in 1941 and Infeld and Hull in 1951. Some of this literature 
is reviewed by Kwong and Rosner (1986). The Hamiltonian is the generator of time 
translations, so this form of supersymmetry essentially amounts to saying that a time 
translation can be expressed as a composite of more fundamental operations. 
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Modern supersymmetry envisions both spatial and time translations as belonging to a 
super-algebra. The Lorentz group is isomorphic to SU(2) & SU(2) (with factors of i thrown 
in to account for the Minkowski metric); under this group space and time translations 
transform as (1/2,1/2). The supercharges transform as (1/2,0) and (0,1/2), clearly more 
fundamental objects. 


Electroweak-scale supersymmetry is motivated by several main points. You will hear 
further details in this lecture series from Abel (2001). 


1. In any gauge theory beyond the standard SU(3)cclour® SU(2),, if the scale A of new 
physics is very high, this scale tends to make its way into the Higgs sector through 
loop diagrams, leading to quadratic contributions ~ g?A? to the Higgs boson mass. 
Unless something cancels these contributions, one has to fine-tune counter-terms in 
the Lagrangian to exquisite accuracy, at each order of perturbation theory. This is 
known as the “hierarchy problem.” 


2. The very nature of a \(¢'#)? term in the Lagrangian is problematic when considered 
from the standpoint of scale changes. This is known as the “triviality problem.” 


3. In the simplest theory by Georgi and Glashow (1974) unifying SU(3)colour@ SU(2);. 
based on the gauge group SU(5), the coupling constants approach one another at 
high scale, but there is some "astigmatism." In a non-supersymmetric model, they 
do not all come together at the same scale. This is known as the "unification 
problem." It is cured in the simplest supersymmetric model, as a result of the dif- 
ferent particle content in loop diagrams contributing to the running of the coupling 
constants. The model has a problem, however, in predicting too large a rate for 
p K*v (Murayama and Pierce 2001, Peskin 2001). 


An alternative scheme for solving these problems, which has had a much poorer time 
constructing any sort of self-consistent theory, is technicolour, the notion that the Higgs 
boson is a bound state of more fundamental constituents in the same way that the pion 
is really a bound state of quarks. This mechanism works beautifully when applied to the 
generation of gauge boson masses, but fails spectacularly (and requires epicyclic patches!) 
when one attempts to describe fermion masses. The basic idea of technicolour is that 
there is no hierarchy problem because there is no hierarchy; a wealth of TeV-scale new 
physics awaits to be discovered in the simplest version (applied to gauge bosons) of the 
theory. 


A further, even more radical notion, is that both Higgs bosons and fermions are 
composite. This scheme so far has run aground on the difficulty of constructing quarks 
and leptons, keeping their masses light by nearly preserving a chiral symmetry (`t Hooft 
1980). One can make guesses as to quantum numbers of constituents (Rosner and Soper 
1992), but a sensible dynamics remains completely elusive. 


5.5 Fermion masses 


We finessed the question of the origin of the Cabibbo-Kobayashi-Maskawa (CKM) matrix. 
It comes about in the following way. 


The electroweak Lagrangian, before electroweak symmetry breaking. may be written 
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in flavour-diagonal form as 


£ LY'WCO D', +he], (199) 


I m; 
in = — —[U* 
' Vil 


where U’ = (u',c',t') and D' = (d',s', b) are column vectors describing weak eigenstates. 
Here g is the weak SU(2); coupling constant, and V; = (1 — 35)v/2 is the left-handed 
projection of the fermion field 2 U or D. 


Quark mixing arise because mass terms in the Lagrangian are permitted to connect 
weak eigenstates with one another. Thus, the matrices My, p in 


Lm = —(U'pMyU', + D'aMpD'r t hc] (200) 


may contain off-diagonal terms. One may diagonalise these matrices by separate unitary 
transformations on left-handed and right-handed quark fields: 


RóMqgLo = Lo6M Ra = AQ. (201) 
where 
Q',-LQQu | Qag-ReoQa = (Q=U.D). (202) 
Using the relation between weak eigenstates and mass eigenstates: U'; = LyU, and 
D', = LpDy,, we find 
Lim = — [Ur W, V Di + hc], (203) 


/2 
where U = (u,c,t) and D = (d,s,b) are the mass eigenstates, and V = LiLp. The 
matrix V is just the Cabibbo-Kobayashi-Maskawa matrix. By construction, it is unitary: 
V*V = VV* = 1. It carries no information about Ry or Rp. More information would be 
forthcoming from interactions sensitive to right-handed quarks or from a genuine theory 
of quark masses. 


Quark mass matrices can yield the observed hierarchy in CKM matrix elements. As 
an example (Rosenfeld and Rosner 2001), the regularities of quark masses evolved to a 
common high mass scale can be reproduced by the choice 


0 ee? 0 
Mg = m| ee? e e], (204) 
0 e 1 


where m3 denotes the mass eigenvalue of the third-family quark (t or b), and e œ 0.07 for 
u quarks, c 0.21 for d quarks. Hierarchical descriptions of this type were first introduced 
by Froggatt and Nielsen (1979). The present ansatz is closely related to one described by 
Fritzsch and Xing (1995). This type of mass matrix leads to acceptable values and phases 
of CKM elements. 


The question of neutrino masses and mixings has entered a whole new phase with 
spectacular results from neutrino observatories such as super-Kamiokande (*"Super-K") 
in Japan and the Sudbury Neutrino Observatory (SNO) in Canada. These indicate that: 


1. Atmospheric muon neutrinos oscillate in vacuum, probably to r neutrinos, with 
near-maximal mixing and a difference in squared mass Am? ~ 3 x 107? eV?. 
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2. Solar electron neutrinos oscillate, most likely in matter, to some combination of 
muon and 7 neutrinos. All possible Am? values are at most about 1074 eV?; several 
ranges of parameters are permitted, with large mixing favoured by some analyses. 


In addition one experiment, the Liquid Scintillator Neutrino Detector (LSND) at Los 
Alamos National Laboratory, suggests v, — v, oscillations with Am? ~ 0.1 to 1 eV, 
with small mixing. This possibility is difficult to reconcile with the previous two, and a 
forthcoming experiment at Fermilab (Mini-BooNE) is scheduled to check the result. For 
late news on neutrinos see the Web page maintained by Goodman (2001). 


A possible explanation of small neutrino masses (Gell-Mann, Ramond, and Slansky 
1979, Yanagida 1979) is that they are Majorana masses of order my = m?,/My, where 
mp is a typical Dirac mass and My is a large Majorana mass acquired by right-handed 
neutrinos. Such a mass term is invariant under SU (2);, and hence is completely acceptable 
in the electroweak theory. The pattern of neutrino Majorana and Dirac masses, and the 
mixing pattern, is likely to provide us with fascinating clues over the coming years as to 
the fundamental origin and nature of mass. 


6 Summary 


The Standard Model of electroweak and strong interactions has been in place for nearly 
thirty years, but precise tests have entered a phase that permits glimpses of physics 
beyond this impressive structure, most likely associated with the yet-to-be discovered 
Higgs boson. Studies of mixing between neutral kaons or neutral B mesons, covered by 
Stone (2001) in these lectures, are attaining impressive accuracy as well, and could yield 
cracks in the Standard Model at any time. It is time to ask what lies behind the pattern 
of fermion masses and mixings. This is an input to the Standard Model, characterised by 
many free parameters all of which await explanation. 
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1 Introduction 


The dedicated study of b-flavoured hadrons has developed into one of the most active 
and promising areas of experimental high-energy physics. The detailed investigation of b 
decays at the B-meson factories and at hadron colliders will probe the flavour physics of 
quarks with unprecedented precision. To fully exploit this rich source of data a systematic 
theoretical approach is necessary. The required field theoretical tools are the subject of 
these lectures. 


We shall discuss the construction of effective weak Hamiltonians, introducing the op- 
erator product expansion (OPE) and the renormalisation group (RG) and presenting the 
effective Hamiltonians for nonleptonic AB = 1 and AB = 2 transitions as examples. The 
subsequent chapter is devoted to heavy-quark effective theory (HQET). It explains the 
basic formalism as well as the application to heavy-light and heavy-heavy currents, dis- 
cussing the B-meson decay constant fg and the exclusive semileptonic decay B — D* Iv, 
respectively. Inclusive b decays and the heavy-quark expansion (HQE) are treated next, in 
particular the general formalism, the issue of quark-hadron duality, the theory of b-hadron 
lifetimes and of inclusive semileptonic decays. Finally, we discuss QCD factorisation for 
exclusive hadronic B decays, focusing on B —^ Dr and B > am. We conclude with a 
short summary. 


The effective-Hamiltonian framework is the oldest and most general of the methods 
we shall discuss. It dates back, more or less, to the beginnings of the standard model 
itself. HQET and HQE are later developments that have been established in the second 
half of the eighties and at the beginning of the nineties. The last topic, QCD factorisation 
for exclusive B decays is the most recent. It is the least well established among these 
methods and it continues to be studied and developed in further detail. 
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We would like to mention a very short selection of literature, which we hope will 
be helpful to obtain further information on the various subjects related to the contents 
of these lectures. Very useful resources are the BaBar Physics Book (Harrison & Quinn 
1998) and the Fermilab B Physics Report (Anikeev et al. 2002). They collect nice reviews 
on both theoretical and experimental topics in B physics. A textbook more specifically 
directed towards theoretical heavy quark physics is the work of Manohar & Wise (2000). 
Review articles on particular subjects are (Buchalla, Buras & Lautenbacher 1996) on 
effective weak Hamiltonians, (Neubert 1994) on HQET and (Bigi, Shifman and Uraltsev 
1997) on HQE. 


2 Introduction and overview 


2.1 Motivation 


In the following chapters we will study the theoretical tools to compute weak decay 
properties of heavy hadrons. To put the formalism into perspective, we start by recalling 
the main motivation for this subject. 


The central goal is the investigation of flavour physics, the most complicated sector 
in our understanding of fundamental interactions. A good example is give by particle- 
antiparticle mixing, as first studied with neutral kaons. The strong interaction eigenstates 
K? (sd) and K? (ds) can be transformed into each other through second order weak 
interactions, which leads to a tiny off-diagonal entry Mi» in the mass matrix (Figure 1). 


d uc 8 
K+ Ko - M Ma| K 
Kis K? W g? M M i? 
S U.C d 


Figure 1. K? - K? mizing. 
The physical eigenstates are Kz, s = (K°+ K?)/ V2 with a mass difference Amy 2m, -ms 
given by 2| Mi»| so that 


Amy ~ G2.f? By 
Mk ~ 672 


[V Val m2 = 7-107), (1) 


where the number on the right-hand side is the experimental value. The theoretical 
expression is derived neglecting the third generation of quarks and CP violation, which 
is a valid approximation for Amg. The factors f2 By (Bx z 1 for the purpose of a 
first estimate) account for the binding of the quarks into mesons. A crucial feature of 
(1) is the Glashow-Iliopoulos-Maiani (GIM) cancellation mechanism: the orthogonality 
of the quark mixing matrix V;; (i=u, c; j—d, s) leads to a cancellation among the various 
contributions with intermediate up and charm quarks (symbolically the amplitude has the 
form (uu) — (uc) — (cu) + (cc)). For m, = m, this cancellation would be complete, giving 
Amx = 0. Still, even for m, Æ Me, the contributions from virtual momenta k >> Me, my 
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cancel since both m, and m, are negligible in this case. What is left is a characteristic 
effect proportional to m2, the up-quark contribution being subleading for m, < Aocp « 
m,. This circumstance allowed Gaillard and Lee in 1974 to correctly estimate the charm- 
quark mass m, zz 1.5 GeV, before charm was eventually discovered in the Fall of the same 
year. 


In a similar way, the discovery of Ba - Ba mixing by the ARGUS collaboration (Al- 
brecht et al. 1987) proved to be another milestone in flavour physics. In full analogy to 
K - K mixing we have 


Amp | GrfpBp 2 452 me —14 
mp vi 672 IVe Vial MyS M2, = 6 - 10 , (2) 


where now the top-quark contribution is completely dominant. The unexpectedly large 
value observed by ARGUS provided the first evidence that the top-quark mass (Mi pole = 
176 GeV) was comparable to the weak scale and in any case much heavier than anticipated 
at the time. 


These examples show very nicely the “flavour” of flavour physics: precision observables 
are sensitive probes of high-energy scales and yield crucial insights into the fundamental 
structure of weak interactions. At the same time we see that hadronic effects manifest in 
quantities such as fg, Bg, and strong interactions of the participating quarks in general, 
play an important role. Their understanding is necessary to reveal the underlying flavour 
dynamics and is the main subject of heavy quark theory. 

B - B mixing, CP violation in B decays and other loop-induced reactions of b-flavoured 
hadrons are of great interest and continue to be pursued by numerous experiments. A 
central target is the Cabibbo-Kobayashi-Maskawa (CKM) matrix 


2 
Vaa Vas Va 1-4 A AX(g — in) 
2 
V=| Va Ves Veo | = -A 1-h AY (3) 
Via Vis Veo AX(1 -o~ in) —AX 1 


which parametrises charged-current weak interactions in the standard model. The second 
equality in (3) introduces the approximate form of the Wolfenstein parametrisation, where 
the four independent CKM quantities are A, A, o and 7. The unitarity triangle, as defined 
in terms of o and 7 is shown in Figure 2, indicating the CP violating angles a, 3 and y. 


(0,7) 


Q 


Y B 
(0,0) (1,0) 


Figure 2. Definition of the unitarity triangle. 
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TI en 


Lid Soe 


Figure 3. Global fit of the unitarity triangle (darker shaded region), without sin 28 from 
CP violation in B > J/WKs. The constraint from the world average sin 28. (hatched 
areas) is overlaid (Hócker et al. 2001). 


The status of the unitarity triangle (in terms of p = o(1— 12/2) and 5 = {1 — 1?/2)) is 
displayed in Figure 3, with input from CP violation in the kaon sector (£x) and from B 
decays (|Vus/ Væl, AMB, Amp, and sin 28). 


2.2 B decays — overview 


A vast number of different B-decay observables is available to further test the standard 
scenario of flavour dynamics. One may distinguish the following broad classes. 


e Dominant decays: 
b— ctd, b> cts, bo clv 
BoDr Bowk, B> Dw 
» Rare decays: 
b — uud, b uüs, b ul? 
Boar, Bork, B-mlp, Bolo 
e Rare and radiative (loop induced) decays: 
b> s(d)y, b> s(di'l, bo s(d)vv 
Bo KC), Bop... 


« AB = 2 oscillations: z 
Ba- Ba, B,- B, mixing 
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Other obvious classifications are between inclusive and exclusive processes or between 
hadronic and (semi)leptonic decays. 


A few key properties of b-hadrons enhance considerably the possibilities in B physics 
both experimentally and theoretically. First, the smallness of V = 0.04 leads to a long 
lifetime of tg ~ 1.6 ps. In addition the b-quark mass is large compared with the QCD 
scale 


m, > Aqcp ~ 0.3GeV. (4) 


The exact value of m, depends on the definition. In particular, the running MS mass 
Tha (Th5) = 4.2+0.1GeV, the pole mass msg, zz 4.8GeV, whereas the mass of the lightest b- 
hadron is mp = 5.28GeV. The smallness of Aocp/m, provides us with a useful expansion 
parameter. Together with the property of asymptotic freedom of QCD and a,(m,) « 1, 
this opens the possibility of systematic approximations, which are exploited in the various 
applications of heavy quark theory. 


3 Effective weak Hamiltonians 


Computing weak decays of hadrons is a complicated problem in quantum field theory. 
Two typical cases, the first-order nonleptonic process B? — +a~, and the loop-induced, 
second-order weak transition B^ — Kv are illustrated in Figure 4. The dynamics 


Figure 4. QCD effects in weak decays. 


of the decays is determined by a nontrivial interplay of strong and electroweak forces, 
which is characterised by several energy scales of very different magnitude, the W mass, 
the various quark masses and the QCD scale: m,, Mw > m, m > Agcp > My, 
ma, (ms). While it is usually sufficient to treat electroweak interactions to lowest non- 
vanishing order in perturbation theory, it is necessary to consider all orders in QCD. 
Asymptotic freedom still allows us to compute the effect of strong interactions at short. 
distances perturbatively. However, since the participating hadrons are bound states with 
light quarks, confined inside the hadron by long-distance dynamics, it is clear that also 
non-perturbative QCD interactions enter the decay process in an essential way. 


To deal with this situation, we need a method to disentangle long- and short-distance 
contributions to the decay amplitude in a systematic fashion. A basic tool for this purpose 
is provided by the operator product expansion (OPE). 
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3.1 Operator product expansion 


We will now discuss the basic concepts of the OPE for B meson decay amplitudes. These 
concepts are of crucial importance for the theory of weak decay processes, not only in 
the case of B mesons, but also for kaons, mesons with charm, light or heavy baryons and 
weakly decaying hadrons in general. Consider, for instance, the basic W-boson exchange 
process shown on the left-hand side of Figure 5. This diagram mediates the decay of 


d u d u 
EN LIC D». 


Figure 5. OPE for weak decays. 


a b quark and triggers the nonleptonic decay of a B meson such as B? — c*«-. The 
quark-level transition shown is understood to be dressed with QCD interactions of all 
kinds, including the binding of the quarks into the mesons. To simplify this problem, we 
may look for a suitable expansion parameter, as we are used to do in theoretical physics. 
Here, a key feature is provided by the fact that the W mass My is very much heavier 
than the other momentum scales p in the problem (ms, Aocp, Mu, Ma, Ms). We can 
therefore expand the full amplitude A, schematically, as follows 


A- c (s. a.) -(@) +0 (fr). (5) 


which is sketched in Figure 5. Up to negligible power corrections of O(p’/Mj,), the full 
amplitude on the left-hand side is written as the matrix element of a local four-quark 
operator Q, multiplied by a Wilson coefficient C. This expansion in 1/My is called a 
(short-distance) operator product expansion because the nonlocal product of two bilinear 
quark-current operators (du) and (ub) that interact via W exchange, is expanded into a 
series of local operators. Physically, the expansion in Figure 5 means that the exchange 
of the very heavy W boson can be approximated by a point-like four-quark interaction. 
With this picture the formal terminology of the OPE can be expressed in a more intuitive 
language by interpreting the local four-quark operator as a four-quark interaction vertex 
and the Wilson coefficient as the corresponding coupling constant. Together they define 
an effective Hamiltonian Her = C-Q, describing weak interactions of light quarks at low 
energies. [gnoring QCD, the OPE reads explicitly (in momentum space) 


A= gi swap eB (du)y.. A (üb)y A 
- 
= um VoaVaC- (Q) +0 , (6) 
with C =1, Q= (du)y-a(ūb)v-4 and 
Gp . 
Her = vg vaa du) v a (ub)v.— (7) 


As we will demonstrate in more detail below after including QCD effects, the most 
important property of the OPE in (5) is the factorisation of long- and short-distance 
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contributions: all effects of QCD interactions above some factorisation scale „u (short 
distances) are contained in the Wilson coefficient C. All the low-energy contributions 
below 4 (long distances) are collected into the matrix elements of local operators (Q). 
In this way the short-distance part of the amplitude can be systematically extracted and 
calculated in perturbation theory. The problem to evaluate the matrix elements of local 
operators between hadron states remains. This task requires in general nonperturbative 
techniques, as for example lattice QCD or QCD sum rules, but it is considerably simpler 
than the original problem of the full standard-model amplitude. In some cases also the 
approximate flavour symmetries of QCD (isospin, SU(3)) can help to determine the non- 
perturbative input. This is true in general for hadronic weak decays. A decisive advantage 
of heavy hadrons is the fact that the heavy-quark mass itself is still large in comparison 
to Agcp. The limit Agcp/mu « 1 can then be exploited, which is achieved, depending 
on the application, by using heavy quark effective theory, heavy quark expansion or QCD 
factorisation for exclusive nonleptonic decays. 


The short-distance OPE that we have described, the resulting effective Hamiltonian, 
and the factorisation property are fundamental for the theory of B decays. However, the 
concept of factorisation of long- and short-distance contributions reaches far beyond these 
applications. In fact, the idea of factorisation, in various forms and generalisations, is the 
key to essentially all applications of perturbative QCD, including the important areas of 
deep-inelastic scattering and jet or lepton pair production in hadron-hadron collisions. 
The reason is the same in all cases: Perturbative QCD is a theory of quarks and gluons, 
but those never appear in isolation and are always bound inside hadrons. Nonperturbative 
dynamics is therefore always relevant to some extent in hadronic reactions, even if these 
occur at very high energy or with a large intrinsic mass scale. Thus, before perturbation 
theory can be applied, nonperturbative input has to be isolated in a systematic way, and 
this is achieved by establishing the property of factorisation. It turns out that the weak 
effective Hamiltonian for nonleptonic B decays provides a nice example to demonstrate 
the general idea of factorisation in simple and explicit terms. 


We would therefore like to discuss the OPE for B decays in more detail, including the 
effects of QCD, and illustrate the calculation of the Wilson coefficients. A diagrammatic 
representation for the OPE is shown in Figure 6. The key to calculating the coefficients 


PX e - € (SQ Re) 


Figure 6. Caiculation of Wilson coefficients of the OPE. 


Cj is again the property of factorisation. Since factorisation implies the separation of all 
long-distance sensitive features of the amplitude into the matrix elements of (Q;), the 
short-distance quantities C; are, in particular, independent of the external states. This 
means that the C; are always the same, no matter whether we consider the actual physical 
amplitude where the quarks are bound inside mesons, or any other, unphysical amplitude 
with on-shell or even off-shell external quark lines. Thus, even though we are ultimately 
interested in, e.g., B — m3 amplitudes, for the perturbative evaluation of C; we are free to 
choose any treatment of the external quarks according to our calculational convenience. 
A convenient choice that we will use below is to take all external quarks massless and 
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with the same off-shell momentum p (p? # 0). The computation of the C; in perturbation 
theory then proceeds in the following steps: 


e Compute the amplitude A in the full theory (with W propagator) for arbitrary 
external states. 


» Compute the matrix elements (Q;) with the same treatment of external states. 


e Extract the C; from A = C; (Qi). 


We remark that with the off-shell momenta p for the quark lines the amplitude is even 
gauge dependent and clearly unphysical. However, this dependence is identical for A and 
(Q;) and drops out in the coefficients. The actual calculation is most easily performed in 
Feynman gauge. To O(a,) there are four relevant diagrams, the one shown in Figure 6 
together with the remaining three possibilities to connect the two quark lines with a gluon. 
Gluon corrections to either of these quark currents need not be considered, they are the 
same on both sides of the OPE and drop out in the C;. The operators that appear on 
the right-hand side follow from the actual calculations. Without QCD corrections there 
is only one operator of dimension 6 


Qi = (diui)v A (üjbj) v 1, (8) 


where the colour indices have been made explicit. To O(a,) QCD generates another 
operator E 

Qe = (diuj)v. A(ü;bi) v A, (9) 
which has the same Dirac and flavour structure, but a different colour form. Its origin is 
illustrated in Figure 7, where we recall the useful identity for SU(N) Gell-Mann matrices 


(d;Tiux) (tj; Th) = -gy (dius) (a) + 5 (deus) (a,b). (10) 


i Uk 


u L 
a J 
T; l 
Figure 7. QCD correction with colour assignment. 


It is convenient to employ a different operator basis, defining 


Qa = LEO, (11) 


The corresponding coefficients are then given by 


C, - Ci O. (12) 
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If we denote by S4 the spinor expressions that correspond to the operators Q+ (in other 
words: the tree-level matrix elements of Q.), the full amplitude can be written as 


2 2 
A= ( + 7,0, ln A S+ ( + y_a, In m ) $.. (13) 


Here we have focused on the logarithmic terms and dropped a constant contribution (of 
order o5, but nonlogarithmic). Further, p? is the virtuality of the quarks and y4 are 
numbers that we will specify later on. We next compute the matrix elements of the 
operators in the effective theory, using the same approximations, and find 


(Q+) = (+140 (: +In +) Sx. (14) 


The divergence that appears in this case has been regulated in dimensional regularisation 
(D = 4 — 2€ dimensions). Requiring 


A = C4(Q4) + C-(Q-), (15) 


we obtain 2 


M, 

Cz = 1 + y+, In "E (16) 

where the divergence has been subtracted in the minimal subtraction scheme. The effec- 
tive Hamiltonian we have been o for then reads 


Her = EVÄ Va (CAQ: * C-(H)Q-) (a7) 


Sey, 
with the coefficients C+ determined in (16) to O(o,log) in perturbation theory. The 
following points are worth noting: 


e The 1/e (ultraviolet) divergence in the effective theory (14) reflects the Mw — oo 
limit. This can be seen from the amplitude in the full theory (13), which is finite, but 
develops a logarithmic singularity in this limit. Consequently, the renormalisation 
in the effective theory is directly linked to the In My dependence of the decay 
amplitude. 


e We observe that although A and (Q4) both depend on the long-distance properties 
of the external states (through p?), this dependence has dropped out in C4. Here we 
see explicitly how factorisation is realised. Technically, to O(a, log), factorisation 
is equivalent to splitting the logarithm of the full amplitude according to 


My Mw p? 
ne oe tee (18) 


Ultimately the logarithms stem from loop momentum integrations and the range of 
large momenta, between My and the factorisation scale p, is indeed separated into 
the Wilson coefficients. 


e To obtain a decay amplitude from Heg in (17), the matrix elements (f|Q.|B)(u) 
have to be taken, normalised at a scale u. An appropriate value for p is close to 
the b-quark mass scale in order not to introduce an unnaturally large scale into the 
calculation of (Q). 
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e The factorisation scale u is unphysical. It cancels between Wilson coefficient and 
hadronic matrix element, to a given order in o, to yield a scale independent decay 
amplitude. The mechanism of this cancellation to O(a,) is clear from the above 
example (13-16). 


» In the construction of Her the W-boson is said to be “integrated out”, that is, 
removed from the effective theory as an explicit degree of freedom. Its effect is still 
implicitly contained in the Wilson coefficients. The extraction of these coeffcients 
is often called a “matching calculation", matching the full to the effective theory by 
“adjusting” the couplings C+. 


» If we go beyond the leading logarithmic approximation O(a, log) and include the 
finite corrections of O(a,) in (13), (14), an ambiguity arises when renormalising 
the divergence in (14) (or, equivalently, in the Wilson coefficients C4). This ambi- 
guity consists in what part of the full (non-logarithmic) O(a,) term is attributed 
to the matrix elements, and what part to the Wilson coefficients. In other words, 
coefficients and matrix elements become scheme dependent, that is, dependent on 
the renormalisation scheme, beyond the leading logarithmic approximation. The 
scheme dependence is unphysical and cancels in the product of coefficients and ma- 
trix elements. Of course, both quantities have to be evaluated in the same scheme to 
obtain a consistent result. The renormalisation scheme is determined in particular 
by the subtraction constants (minimal or non- minimal subtraction of 1/e poles), and 
also by the definition of ys used in D # 4 dimensions in the context of dimensional 
regularisation. 


» Finally, the effective Hamiltonian (17) can be considered as a modern version of the 
old Fermi theory for weak interactions. It is a systematic low-energy approximation 
to the standard model for b-hadron decays and provides the basis for any further 
analysis. 


3.2 Renormalisation group 


Let us have a closer look at the Wilson coefficients, which read explicitly 


(0) 2 
u os (Hu) Y+ H (0 _ 4 
C=1+ 4r 2 In MZ Yi = PE (19) 


where we have now specified the exact form of the O(a,log) correction. Numerically 
the factor a,(ms) yP /(87) is about +3.5% (—7%), a reasonable size for a perturbative 
correction (we used a,(u = 4.2 GeV) = 0.22). However, this term comes with a large 
logarithmic factor of In(g?/Mg,) = —6, for an appropriate scale of u = 4.2 GeV. The 
total correction to C4 — 1 in (19) is then —2196 (4296)! The presence of the large 
logarithm spoils the validity of a straightforward perturbative expansion, despite the fact 
that the coupling constant itself is still reasonably small. This situation is quite common in 
renormalisable quantum field theories. Logarithms appear naturally and can become very 
large when the problem involves very different scales. The general situation is indicated 
in the following table, where we display the form of the correction terms in higher orders, 
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denoting £ z In(u/ Mw): 


LL NLL 
ase Qs 
o2? al æ 
ab o3? œl o (20) 
4 4 
O(1) Olas) 


In ordinary perturbation theory the expansion is organised according to powers of a, 
alone, corresponding to the rows in the above scheme. This approach is invalidated by 
the large logarithms since o,£, in contrast to as, is no longer a small parameter, but 
a quantity of order 1. The problem can be resolved by re-summing the terms (o;,£)" 
to all orders n. The expansion is then reorganised in terms of columns of the above 
table. The first column is of O(1) and yields the leading logarithmic approximation, 
the second column gives a correction of relative order as, and so forth. Technically the 
reorganisation is achieved by solving the renormalisation group equation (RGE) for the 
Wilson coefficients. The RGE is a differential equation describing the change of C4 (x) 
under a change of scale. To leading order this equation can be read off from (19) 
d Os 


Cx(u) = 1239 Cal). (21) 


dln pt 


The factors (o, Az) are called the anomalous dimensions of C+. To understand the 
term “dimension”, compare with the following relation for the quantity u”, which has 
(energy) dimension n: 

Th uM cmm (22) 
The analogy is obvious. Of course, the C.(j) are dimensionless numbers in the usual 
sense; they can depend on the energy scale p only because there is another scale, My, 
present under the logarithm in (19). Their “dimension” is therefore more precisely called 
a scaling dimension, measuring the rate of change of C4 with a changing scale pu. The 
nontrivial scaling dimension derives from O(a,) loop corrections and is thus a genuine 
quantum effect. Classically the coefficients are scale invariant, C4 = 1. Whenever a 
symmetry that holds at the classical level is broken by quantum effects, we speak of an 
“anomaly”. Hence, the 40 represent the anomalous (scaling) dimensions of the Wilson 
coefficients. 


We can solve (21), using 


das o? 33 — 2f 7 
Ima -2o Bo = —3 ^ Cy(My) = 1, (23) 
and find 
(0) (0) 
26o 2 17 Y+ /26o 
_ as(Mw) Ye / E os(u),. Mw 
e| creme ed 


This is the solution for the Wilson coefficients C4 in leading logarithmic approximation, 
that is to leading order in RG improved perturbation theory. The all-orders re-summation 
of a, log terms is apparent in the final expression in (24). 
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3.3 The AB = 1 effective Hamiltonian 


In this section we will complete the discussion of the AB — 1 effective Hamiltonian. So 
far we have considered the operators 


QU = (dip)v-A(Djbj)v-A, (25) 


Qb = (dipi)v-A(Djbi)v- A; (26) 


which come from the simple W-exchange graph and the corresponding QCD corrections 
(Figure 8). We have slightly generalised our previous notation, allowing for the cases 


p-u,c. 
W 
u.C 
d , VÀ, c.t 


gb 
b uU, C P g 


Figure 8. Correction to W exchange. Figure 9. QCD-penguin diagram. 


In addition, there is a further type of diagram at O(a,), which we have omitted until 
now: the QCD-penguin diagram shown in Figure 9. It gives rise to the four new operators 


Qs = (dib)v-AY (djqi)v-A. (27) 
Qi = (d) A (Bava (28) 
Qs = (dva Y ava (29) 
Q = bi Yale (30) 


Two structures appear when the light-quark current (gg)y from the bottom end of the 
diagram is split into V — A and V + A parts. In turn, each of those comes in two colour 
forms in a way similar to Q, and Qe. Finally, one further gauge-invariant operator of 
dimension six appears in the matching procedure, the chromomagnetic penguin operator 


qu gimdo WMC an 


This operator corresponds to the diagrams in Figure 9 with the lower quark line omit- 
ted. The gluon is thus an external field, represented in (31) by the field-strength tensor 
Gw- Note that the characteristic tensor current necessitates a helicity flip in the b > d 
transition, which is accompanied by a factor of the quark mass m, (the effect of mg is 
neglected). The contribution of Qs; would be very small for the Hamiltonian of K decays, 
which only involves light external quarks, but it is unsuppressed for b decays. 


The operators Qi,..., Qe, Qs, mix under renormalisation, that is the RGE for their 
Wilson coefficients is governed by a matrix of anomalous dimensions, generalising (21). 
In this way the RG evolution of C; ? affects the evolution of C3,. .. , Cs, Cgg- On the other 
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hand C2 remain unchanged in the presence of the penguin operators Qs, . .., Qe, Qeg, so 
that the results for C1; derived above are still valid. 


The construction of the effective Hamiltonian follows the principles we have discussed 
in the previous sections. First the Wilson coefficients C;(uw), i = 1,...,6,8g, are de- 
termined at a large scale uw = O(Mw,mi) to a given order in perturbation theory. 
In this step both the W boson and the heavy top quark are integrated out. Since the 
renormalisation scale is chosen to be uw = O(Myw, my), no large logarithms appear and 
straightforward perturbation theory can be used for the matching calculation. The anoma- 
lous dimensions are computed from the divergent parts of the operator matrix elements, 
which correspond to the UV-renormalisation of the Wilson coefficients. Solving the RGE 
the C; are evolved from py to a scale u = O(m,) in a theory with f = 5 active flavours 
q = u,d,s,c,b. The terms taken into account in the RG improved perturbative evaluation 
of C;(u) are, schematically: 


LO: (o; In Mw)”, NLO: as (o; In Muy, 


at leading and next-to-leading order, respectively. 


The final result for the AB = 1 effective Hamiltonian can be written as 


Ha = Gr Ap |CiQF + O2Q5+ SY Q| + he, (32) 
2 p-u,c 1—3,...,6,8g 
where A, = Va Vpb- In principle there are three different CKM factors, A,, Àc and A,, cor- 
responding to the different flavours of up-type quarks that can participate in the charged- 
current weak interaction. Using CKM unitarity, one of them can be eliminated. If we 
eliminate À;, we arrive at the CKM structure of (32). 


The Hamiltonian in (32) is the basis for computing nonleptonic 6-hadron decays within 
the standard model (to lowest order in electroweak interactions) with AB — 1 and AS, 
AC = 0. The Hamiltonian for b-decays with different flavour quantum numbers of the 
light quarks has a completely analogous form. For instance, AB = 1 transitions with 
a simultaneous change in strangeness, AS = 1, are simply described by (32) after the 
replacement d — s. When new physics is present at some higher energy scale, the 
effective Hamiltonian can be derived in an analogous way. The matching calculation 
at the high scale uw will give new contributions to the coefficients C;(uw), the initial 
conditions for the RG evolution. In general, new operators may also be induced. The 
Wilson coefficients C; are known in the standard model at NLO. A more detailed account 
of 4(AP-! and information on the technical aspects of the necessary calculations can be 
found in (Buchalla, Buras & Lautenbacher 1996) and (Buras 1998). 


3.4 B-B mixing at NLO 


In the following section we present the effective Hamiltonian for a AB = 2 transition, 
which is relevant for B — B mixing. In this case only a single operator contributes. The 
form of the Hamiltonian is therefore particularly simple. We use this example to illustrate 
the structure of Wilson coefficients at next-to-leading order. The mass difference Amg in 
the B — B system is related to the effective Hamiltonian H4P=? by 


1 = — 
Ams = 2|Mis|s = — K(B[RAP7| B). (33) 
mpg 
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In order to construct H4 =, the full standard model amplitude for AB = 2 transitions is 
matched onto the effective theory amplitude at the matching scale uj = O(m) = O( My). 
This is sketched in Figure 10. 


b u,c,t d b d 

we br + QCD ~ Clu): >t 

d b d b 
u,c,t 


Figure 10. OPE for B - B mizing. 


There is only one local operator 


Q = (bd)v .A(bd)v 4. (34) 


The Wilson coefficient, up to next-to-leading order, can be written as 


a 

C(u) = C (u) + TC. (uo: (35) 
where C is the lowest order result and C") comes from the corrections with one-gluon 
exchange. The RG evolution from the high scale jj, down to a scale p = O(mj) has the 


form 6/28 


The second factor on the right-hand side is familiar from the leading logarithmic approx- 
imation (the only difference is that at NLO the two-loop expression for o,(u) has to be 
used). The first factor represents the next-to-leading order correction. Here J5 is a scheme- 
dependent constant, which in the usual, so-called NDR, scheme reads J; = 5165/3174. 


We now have the ingredients to write the effective Hamiltonian up to NLO precision 


_ GLM) as 
Het = as (uva): Clu) Q (37) 
G?. Mj s 
= Ces (Vie Va) So(0)na[os(2)] °° pea Q. 


1672 T 


The result is entirely dominated by the top-quark contribution. It is common practice to 
separate the coefficient C(x) into the function So(z,) (with z; = m?/Mj,), which would 
be the coefficient in the absence of QCD effects, into the terms that depend on o (4), and 
the remainder, which is defined as the QCD-correction factor ng. This has been done in 
the second equation in (37). Taking the matrix element of H4-? between the B and the 
B state and using (33) gives 


GM, 


6n2 IV Vial’ So(2e)ne Ba fgme. (38) 


Amp = 


One encounters the hadronic matrix element of Q, which is written as 


(BIQIB)(u) = 5 flm Bolu), (39) 
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defining the (scale and scheme dependent) hadronic parameter Bg(:). The combination 


Bs = Bs(n)|os(u)] 9 [ * e| (40) 


is formally scale and scheme independent and has been used in (38). The parameter Bp(y:) 
is a nonperturbative quantity and has to be determined e.g. by lattice calculations. At 
present the value of Bg is still very uncertain, in contrast to the short-distance QCD 
corrections, which are precisely known. A numerical illustration is given in Table 1, 
where we have put Bg(p) = 0.9 as an example. 


Table. | 0.846 | o9 | =0.761 | 


ne — [o.()] 97? [1+ ete «| Ba(u) 


0.551 [ 1.38 = 0.761 


Two different definitions of a short-distance QCD factor can be considered, depending 
on where the terms with a,(j) are included. One possibility is to include them with 
ne into a Wilson coefficient (=0.846), which is to be multiplied by the hadronic matrix 
element Bg(q) = 0.9. The other possibility is the formally scheme independent separation 
into ng = 0.551 and Bg = 1.38 (for ng this is the precise result; Bg = 1.38 is only true 
in our example). The purpose of this exercise is to remind us that different definitions 
are sometimes employed for the parameter Bg and care has to be taken which one is 
being used, in order to combine it with the appropriate short-distance corrections. We 
can also see that the large deviation of the QCD correction factor 7g from 1 is merely a 
consequence of pulling out the large factor [o(1)] 9/25. It is somewhat artificial and does 
certainly not indicate a problem for perturbation theory. In fact, the coefficient 0.846 is 
the one that has the proper limit, approaching 1 as a, — 0. It is indeed much closer to 
unity in accordance with the expectation for a perturbative correction factor. Still, the 
use of 7g = 0.551 is often adopted due to its formally scheme invariant definition. 

An important application is the ratio of the mass differences for Bg and B, mesons, 
for which (38) implies 
? mg, fha Bg, 

me, fo, Be, 


Ame, Via 


Vis 
This quantity is a very useful measure of |Viq/Vis|. All other short-distance physics (top- 
dependence, ng) has dropped out. Hadronic uncertainties are reduced in the ratio of 
matrix elements, which is 1 in the limit of unbroken SU(3) flavour symmetry. The can- 
cellation of the short-distance contribution is a direct consequence of the factorisation 
property of the OPE. Lattice calculations give for the ratio of matrix elements (Hócker 
et al. 2001, and references therein) 


favVPs. L 1 1640.06. (42) 


fo, J Bs, 


The ratio Amp, /Amg, is a very powerful constraint for the unitarity triangle, as can be 
seen in Figure 3. 


(41) 


Amg, 
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4 Heavy quark effective theory 


4.1 Basic formalism 


Heavy quark effective theory (HQET) is an effective field theory designed to systematically 
exploit the simplifications of QCD interactions in the heavy-quark limit for the case of 
hadrons containing a single heavy quark. The HQET Lagrangian can be derived as follows. 
We start from the usual QCD Lagrangian for a heavy-quark field V with mass m 


L=Vipv—mvv, (43) 
with the covariant derivative 

D, = 0, — igT° Aj. (44) 
The heavy-quark momentum can be decomposed as 


p=mv+k, (45) 


where v is the 4-velocity of the heavy hadron. Once mv, the large kinematical part of the 
momentum is singled out, the remaining component k is determined by soft QCD bound 
state interactions, and thus k = O(Aqcp) « m. We next decompose the quark field Y 
into 


NONE em at) (46) 
H,(z) = mer Fa), (47) 

which implies l 
U(r) = e? (h,(z) + Hy(2)). (48) 


The expressions (1 + #)/2 are projection operators. Their action represents the covariant 
generalisation of decomposing V into upper and lower components. Using the standard 
representation for y-matrices, this is evident in the rest frame where f = 7°. Note also that 
the equation of motion with respect to the large momentum components, m(y — 1)h, = 0, 
is manifest for hy. 


The exponential factor exp(imv-z) in (46), (47) removes the large-frequency part of 
the r-dependence in V(z) resulting from the large momentum mv. Consequently, the 
z-dependence of h,, Hy is only governed by the small residual momentum and derivatives 
acting on h, and H, count as O(Agcp). (Our sign conventions are appropriate for a heavy 
quark. To describe the case of a heavy anti-quark, similar definitions are valid with the 
sign of v reversed.) 

Multiplying the QCD equation of motion (i — m) = 0 with the projectors (1— $)/2 
and (1 + #)/2, and using (46) - (48) and the definition 


D" = D! — vv- D, (49) 


we obtain the coupled system of equations 


iv.Dh, = —i,H,, (50) 
(iv.D+2m)H, = iP hy. (51) 
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They represent the equation of motion in terms of h, and H,. The second equation implies 
that H, = O(Agcp/m)h, by power counting. Hence H, is suppressed with respect to hy 
in the heavy-quark limit. In other words, h, contains the large components, H, the small 
components of V. 


The HQET Lagrangian is obtained starting from (43), expressing V in terms of hy, 
H, and eliminating H, using (51). We find 
I. ic 1 . 
L= hiv Dh, + hyiD, aD gom Pih (52) 
Alternatively, H, as obtained from (51) in terms of h, can be inserted into (50) to yield 
the equation of motion for h,. This equation is just the equation of motion implied by 
(52) (upon variation with respect to hy, i.e. 6£/óh, = 0). The Lagrangian may thus be 
written down immediately given the equation of motion for the field h,. 


The second term in (52) contains the nonlocal operator (iv.D + 2m)^!. It can be 
expanded in powers of Agcp/m to yield a series of local operators. Keeping only the 
leading-power correction we can simply replace (iv-D + 2m)! by (2m)! and get 


= hiv- LRGD h + Lh or 5 
L = hyiv-Dhy zh D.) h + hyo!” Gus. (53) 


We now discuss some important aspects of this result. 


The first term on the right hand side of (53) is the basic, lowest-order Lagrangian of 
HQET. It describes the "residual" QCD dynamics of the heavy quark once the kinematic 
dependence on m is separated out. Since there is no longer any reference to the mass 
m, the only parameter to distinguish quark flavours, this term is flavour symmetric: The 
dynamics is the same for b and c quarks in the static limit. Since the operator v-D 
contains no y-matrices, which would act on the spin degrees of freedom, the leading 
HQET Lagrangian also exhibits a spin symmetry. This corresponds to the decoupling of 
the heavy-quark spin in the m — oo limit. Together, we have the famous spin-flavour 
symmetries of HQET (Isgur & Wise 1989). They lead to relations among different heavy- 


hadron form factors. 
From the Lagrangian h,iv.Dh, the Feynman rules for HQET can be read off. The 
propagator is , 
i 1+ 

— 54 

vk 2 (54) 
where the projector (1 + #)/2 appears since h, is a constrained spinor (see Equation 46). 
The interaction of the heavy-quark field h, with gluons is given by the vertex 


igv" T^. (55) 
These Feynman rules enter in the computation of QCD quantum corrections. 


The remaining terms in (53) are the leading power corrections. They have an intu- 
itive interpretation. In the first term one recognises the operator for the nonrelativistic 
kinetic energy p?/(2m), which describes the residual motion of the heavy quark recoiling 
against the light degrees of freedom inside the heavy hadron. The last term represents the 
chromomagnetic interaction of the heavy-quark spin with the gluon cloud. Both effects 
violate flavour symmetry, the chromomagnetic term also spin symmetry, but they are 
power suppressed. 
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So far we have only considered QCD interactions. Weak interactions introduce exter- 
nal currents, which can also be incorporated in HQET. A generic heavy-light transition 
current ql V, arising for instance in semileptonic decays, can be represented as 


1 
gP V - 4T h + O(—), (56) 


replacing the heavy-quark field V by the HQET field ^, using (48). 


4.2 "Theory of heavy-hadron masses 


Before considering HQET in the context of weak decays, let us discuss a first application of 
the basic HQET Lagrangian (53) in the spectroscopy of heavy hadrons. To be specific, we 
shall analyse the masses of the ground-state mesons B and B*. These mesons constitute 
a doublet that arises because the spin 1/2 of the heavy quark couples with the total spin 
1/2 of the light degrees of freedom in their ground state to form a spin-0 and a spin-1 
meson, the pseudo-scalar B and the vector B*, respectively. Because the b-quark spin 
decouples in the heavy-quark limit, the state of the light cloud is identical for B and B* 
to leading order, and the angular-momentum coupling described above is the appropriate 
scheme. If we neglect the power corrections in (53), we can immediately write down the 
composition of the meson masses 


mQ — mV) =m +A. (57) 


Evidently the meson mass has a component m, from the heavy quark. In addition it has 
a term A = O(Aqcp) from the energy of the light constituents. The latter is determined 
only by the interactions among the light degrees of freedom and their interaction with 
the static bquark (h,) through the first term in (53). It is therefore independent of my. 
The sum of m and A is a physical quantity, however, separately both parameters are 
dependent on the scheme used to define them. 


In order to include the first power corrections, we treat the 1/m terms in (53) as 
perturbations to the lowest-order HQET dynamics. To first order in perturbation theorv 
the corrections to (57) are then simply given by the expectation values of the 1/m terms. 
The proper normalisation is obtained as follows. If H = —L1/m is the Hamiltonian 
(density) corresponding to the correction term £;,, in (53), and H = f d?z 4L is the 
Hamilton operator, the mass correction due to H is just 


ómg = (Bi| HIBi), (58) 


where |B} is the B-meson state normalised to one, (B,|B,) = 1. Using the conventionally 
normalised states with (B|B) = 2mgV , we can write 


1 3 1 (B\H(0)|B) 
= B) = f &x mom = , (5 
ima = zo (BI | Pe G9IB) = zy | Pe (BIHOIB) =", (89) 
where we have used the translation invariance of H and f d?r = V. Defining 
h(iDy 1 (B|hga-Gh|B 
„ = (BIGDPMB) — y 1 Blige GhB) (60 
2mp 6 2mp 
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we obtain 
Ar + 3À2 


2m, 

Note that we may replace D? by D? in the definition of A,, up to higher order corrections 
(see (49), (50)). The parameter A, corresponds to (minus) the expectation value of the 
momentum squared of the heavy quark, 4, = —(p?) = O(AQcp). This gives a positive 
correction in (61) representing the (small) kinetic energy of the heavy-quark. The Az- 
correction to the mass reflects the interaction energy of the heavy-quark spin with its 
hadronic environment, as already discussed in the previous section. While the A;-term 
is independent of the heavy-quark spin and identical for B and B*, the chromomagnetic 
correction ~ Àz = O(A&cp) is different for B*. We have 


ómg = — (61) 


Ay — Ae 
ô = -————. 2 
M p* 2m; (62) 
Including (57) we arrive at the following expansion for the meson masses 
- Ay+3A 
mp = mytA- LA, (63) 
2Mp 
5 30-2 
= A — 
T p* ma + 2m, (64) 
where the dependence on mz, is explicit order by order. 
If we apply the heavy-quark limit to D mesons, we obtain analogous relations 
~ A, +3r 
Mp = me + A ILS, (65) 
2m, 
- Ay — X 
Mpk = Me +A- p 2 (66) 


with the same A, A, and A» as before. 


These results have a few interesting consequences. First, Ag parametrises the spin- 
splitting between the pseudo-scalar and the vector mesons: 


2A 

mpx- mg = — = 46 MeV, (67) 
b 
2A 

mpx- mp = — = 141 MeV. (68) 


HQET predicts that the spin-splitting scales inversely proportional to the heavy-quark 
mass. This is seen to be quite well fulfilled given that m; z 3m,. Relation (67) can be 
used to determine the nonperturbative quantity Àz from experiment 


1,» 


à» = - (mis — mp) = 0.12GeV?. (69) 


On the other hand, the quantity A, has to be estimated theoretically. Finally one may 
introduce the spin-averaged masses 


L Tg + 3M px l Àl 
= = A — — 
MB 1 Ma + Im,’ (70) 
+3 - À 
mp = TP ot ow +i- (71) 
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This eliminates Az and yields the useful result 


my — m, = (mg — mp) (.- AL ). (72) 


Since the A,-correction is fairly small, the quark-mass difference is rather well determined, 
much better than individual quark masses. 


Exercise 


Derive the relative factor between the chromomagnetic correction to the mass of the B 
and the B* meson. 


Solution: Denote the heavy-quark spin by s, the total spin of the light degrees of 
freedom by j and the total spin of the meson by J = s +j. The chromomagnetic field of 
the light cloud has to be proportional to j. Hence the energy of the interaction between 
this field and s is proportional to (s-j). The angular momentum algebra then gives 
(2s-j) = J(J +1) — s(s +1) — j(j +1), which is (—3/2) for B and (1/2) for B*, hence 
the relative factor (—1/3) of the A;-term in (62) with respect to (61). 


4.3 Heavy-light currents and fz 
The B-meson decay constant fg is defined by the matrix element 
(01A,|B(p)) = —ifgmgv,, (73) 
of the heavy-light axial vector current 
A, = dus. (74) 


Here q is the light-quark, V the heavy-quark field in full QCD, with V = b in the present 
case. The B-meson momentum is p — mgv. 


Let us analyse A, in HQET, including QCD corrections. The expansion of A, in 
HQET to leading order in 1/m, but allowing for QCD effects, has the form 


~ ~ 1 
A — Ci (u)À + Cu) Ap + O (=) (75) 
Åi = My Yshe Åz = Guy Ysho. (76) 


The matching conditions at the b-quark mass scale u = mẹ are 
Ci(m;) = 1 4 Olas) C»(m,) = Olas). (77) 


To leading order in QCD only A, is present in HQET, with coefficient one. Radiative 
corrections at O(a,) modify C, and generate a new operator Ag. Note that the matching 
calculation of the full-QCD current A onto HQET, leading to (75), is completely anal- 
ogous to the OPE procedure of constructing the effective weak Hamiltonian from the 
W-exchange amplitude in the full standard model, which we have discussed in Section 3. 
The difference is only that a 1/My expansion is performed in the latter case, and a 1/m, 
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expansion in the case of HQET. The basic philosophy is essentially the same. In partic- 
ular, a factorisation of long and short-distance contributions is obtained: contributions 
from large scales greater than p, including the m,-dependence, are again contained in the 
coefficient functions C5. Soft scales less than p are factorised into the hadronic matrix 
elements of ES 


In contrast to the full-QCD current A, the HQET currents do have an anomalous 
dimension, reflecting a logarithmic dependence of fg on the heavy-quark mass at O(a). 
The logarithms can be re-summed by renormalisation group methods, again in full analogy 
to the procedure in Section 3. In leading logarithmic approximation (LLA) C5 can be 
neglected and C, acquires the familiar form 


El —2/Bo 
as (Ht) 


Here the LLA assumes the hierarchy o;(my) < 1, a, In(m/p) = O(1), which holds in 
the heavy-quark limit (m, large, y = O(1 GeV)). 
To express fg in HQET via (73), (75) and (78), we need the matrix element of A; 


(0|Ai|B(p)) = —if (u) /mgv,. (79) 


Since the dynamics of HQET is independent of mj, the reduced decay constant f(y) is 
my-independent. The only my-dependence in (79) enters through a trivial factor ymse 
from the normalisation of the B-meson state, which in the usual convention is given by 


C) = | (78) 


(B|B) = 2mgV. (80) 
Collecting the ingredients, (75) yields 
PE Ju) [sew]. (81) 
Jie | os(u) 


This expression for fg is true to leading order in the HQET expansion in Aqcp/m 
and in leading logarithmic approximation in QCD. The factor f(u) in (81) is still a 
nonperturbative quantity to be determined by other methods. However, the dependence 
of fg on the heavy-quark mass is now explicit. Equation (81) implies the scaling behaviour 
fs ^ l/ymg, up to a calculable logarithmic dependence on m. In principle such a 
relation can be used to relate fg to the analogous quantity fp for heavy mesons with 
charm. In practice, it turns out that the leading order scaling result for fg is not very 
well fulfilled even for the b-mass scale and that subleading power corrections are important 
in this case. Nevertheless the result in (81) is of conceptual interest and can serve as a 
simple example of an application of HQET. 


4.4 Heavy-heavy currents: B—D*)lIp and Va 


One of the most important results of HQET is the extraction of V; from exclusive semilep- 
tonic B + D*lp decay. We will here give a short outline of the main steps in this analysis. 
The starting point is the differential decay rate 

al(B — D*lp) 


do o 7 Wal KQu) FQ) (82) 
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in the kinematical variable w = vw’, where v and v’ are the 4-velocities of B and D*, 
respectively. The dependence of (82) on |V.»|, the quantity of interest, is obvious, and K(w) 
is a known kinematical function. Finally, F(w) contains the nontrivial QCD dynamics 
encoded in the B — D* transition form factors. The corresponding matrix elements of 
the weak currents can be written in the heavy-quark limit as 


"moe D OAB) = ig(w)e(m e, v^, v), (83) 
—_(D*(v', |e yu790|B(v)) = gw) [O+ wen evo]. (80 


4 /Mp*xmMB 


In the heavy-quark limit, that is to lowest order in HQET, all hadronic dynamics is 
expressed in a single function £(w), the Isgur- Wise function (Isgur & Wise 1989). In this 
limit we further have 

F(w) = €(w). (85) 


Moreover, € is absolutely normalised at the no-recoil point 


£(1) = 1. (86) 


The no-recoil point w = 1 corresponds to the kinematical situation where the D* meson 
stays at rest in the rest frame of the decaying B (v' = v > w = 1). Measuring dT /dw at 
w = 1, |V4| can then be determined from (82) since all ingredients are known. Because 
w = 1 is at the edge of phase space, an extrapolation is necessary to find dl/dwl,-, from 
the measured spectrum. 


For a realistic analysis corrections to the heavy-quark limit need to be considered. An 
important property of B — D*lv is that linear power corrections in HQET are absent, 
dijm = 0, where m can be either m; or mẹ. Consequently the leading corrections enter 
only at second order and are thus greatly reduced. This result is known as Luke's theorem. 
The absence of linear corrections does not hold for B —> Div decays, hence the particular 
importance of B — D*lv. Including corrections, the lowest order approximation F(1) = 
£(1) = 1 is modified to 

F(1) = nall + Sijm2), (87) 


where 6 /m2 are the second order power corrections and 74 is a correction from perturbative 
QCD. To first order in a, it reads 


weit (tmm 8) E 
A detailed numerical analysis yields (Harrison & Quinn 1998) 
F(1) = 0.913 + 0.042, (89) 
which gives (Hócker et al. 2001) 
Veo = 0.0409 + 0.001 4exp + 0.0019 in. (90) 


To summarise the crucial points for the extraction of V4 from B — D*li decay: 
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e Heavy-quark symmetry relates the various semileptonic form factors (four different 
functions V, Ao, Ai, A» in full QCD) to a single quantity £(w), the Isgur- Wise 
function. 


e The function £ is absolutely normalised, £(1) = 1. This property has an intuitive 
reason: At the kinematical point w = 1 the decaying b-quark at rest is transformed 
into a c-quark, also at rest. Since both quarks are treated in the static approximation 
(my, Me — oo, m/m, fixed), the light hadronic cloud doesn't notice the flavour 
change b — c and is transfered from the B to a D meson with probability one. 
The function £ is identical for B — D and B — D* transitions, because these are 
related by heavy-quark spin symmetry. 


e HQET provides a framework for systematic corrections to the strict heavy-quark 
limit governed by £(w). Luke's theorem guarantees the absence of first-order cor- 
rections in 1/m for B + D*lp. 


4.5 HQET: conclusions 


We would finally like to summarise the basic ideas and virtues of HQET, and to re- 
emphasise the salient points. 


» HQET describes the static approximation for a heavy quark, covariantly formu- 
lated as an effective field theory and allowing for a systematic inclusion of power 
corrections. 


e Order by order in the expansion in Agcp/m HQET achieves a factorisation of hard, 
perturbative contributions (momentum scales between m and a factorisation scale y) 
and soft, nonperturbative contributions (scales below u). The former are contained 
in Wilson coefficients, the latter in the matrix elements of HQET operators. 


e The procedure of matching full QCD onto HQET is analogous to the construction 
of the effective weak Hamiltonian Her. The difference lies in the massive degrees 
of freedom that are being integrated out: the W boson (mass My) for Heg, the 
lower-component spinor field H, (mass 2m) for HQET. The perturbative matching 
can be supplemented by RG resummation of logarithms, In(My /j:) in the former 
case, In(m/p) in the latter. 


e The usefulness of HQET is based on two important features: The spin-flavour sym- 
metry of HQET relates form factors in the heavy-quark limit and thus reduces 
the number of unknown hadronic quantities. The dependence on the heavy-quark 
masses is made explicit (scaling, power corrections). 


We conclude with briefly mentioning another field, called large energy effective theory 
(LEET), which has some similarities with HQET. LEET is needed for matrix elements 


of the form (M|gT b|B) at large recoil of the light meson M = 7, p, Ke), etc. Then 
HQET is not sufficient in this situation because not only soft but also collinear infrared 
singularities need to be factorised. The latter occur due to the light-like kinematics of 
the fast and energetic light quark emitted from the weak current. To define LEET the 


80 Gerhard Buchalla 


usual heavy-quark limit can be considered for the B meson with velocity v. The large- 
energy limit is taken for the light meson M with light-like momentum vector En. Here 
E = O(m,) is the energy of M and n is a light-like 4-vector with n? = 0 and vn = 1. 
The momentum of the energetic light quark q is written as p; = En+k, with a residual 
momentum k = O(Agcp). In formal analogy to the fields h, and H, in HQET, the new 
light-quark fields 


mle) ee Efe) Qula) = et Pac) (9) 


can be defined and used in the construction of LEET (Dugan & Grinstein 1991, Charles 
et al. 1999, Beneke & Feldmann 2001, Bauer et al. 2001a). As a consequence of the 
LEET limit the ten form factors needed to describe all matrix elements (M|g T b| B) of 
bilinear heavy-light currents can be reduced to only three independent functions. LEET 
has received increasing interest quite recently and is still under active development. 


5 Inclusive decays and the heavy quark expansion 


5.1 Basic formalism and theory of lifetimes 


The heavy-quark limit, m > Agcp, proves to be extremely useful also for the computation 
of inclusive decay rates of heavy hadrons (Chay et al. 1990, Bigi et al. 1992, 1997). The 
specific technique appropriate for this application is distinct from HQET and goes by 
the name of heavy quark expansion (HQE). Consider the total decay rate Dy of a heavy 
hadron H. The starting point for the HQE is the following representation of F y 


1 
Pu = z—(HITIB) = (1), (92) 
TH 
where the transition operator 7 is defined as 
T -Im i f dz T 2, (2) Hee (0), (93) 


with Heg the effective weak Hamiltonian. Equations (92), (93) express the total decay 
rate as the absorptive part of the forward scattering amplitude H — H under the action 
of Heg. This expression is referred to as the optical theorem by analogy to a similar 
relation in optics. One may rewrite (92), (93) in a more directly understandable form 
by inserting a complete set of states | X) (X| between the two factors of He in (93) and 
removing the 7-product by explicitly taking the absorptive part. This yields 


Eg ~ (A |Her 


X)(X Heal), (94) 


where one immediately recognises the decay rate as the modulus squared of the decay 
amplitude (summed over all final states X). The reason to introduce (93) is that the T- 
product, by means of Wick’s theorem, allows for a direct evaluation in terms of Feynrnan 
diagrams. 


In order to compute Pg an operator product expansion is applied to (93), resulting in 
a series of local operators of increasing dimension. The coefficients of these operators are 
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correspondingly suppressed by increasing powers of 1/m,. The series has the form 
7 ZG y Zqi y - 
T = Ty bb + 2 bgo Gb +S) -5 brig gbt., (95) 
b b 


where we have written the first few operators of dimension three (bb), five (bgo-Gb) and 
six (briq qI';b). The matrix elements of the operators contain the soft, nonperturbative 
physics, their Wilson coefficients I, z, the hard contributions, which are calculable in 
perturbation theory. Again, the coefficients are determined by an appropriate matching 
calculation between (93) and the r.h.s. of (95). The Feynman diagrams for the three 
terms in (95) are shown in Figure 11 The two weak-interaction vertices in these diagrams 


Figure 11. Heavy quark expansion for the total decay rate of b-hadrons. 


correspond to the two factors of Her in the definition of 7 in (93) (the absorptive part. of 
the diagrams is understood). 


Obviously, the heavy quark expansion is different from HQET. However, we may 
still use HQET in conjunction with (95) in order to further analyse the hadronic matrix 
elements. An important example is the leading dimension-three operator 6b. Its matrix 
element between heavy-hadron states H can be expanded in HQET as 


(bb) =14 mi (h(iD)2h) + i (hgoGh), (96) 
where (...) = (H|...|H)/(2my). 

Equations (92), (95) and (96) imply that to leading order in the HQE Ty =Ts, that 
is the total decay rate of all b-flavoured hadrons is equal to the rate of free b-quark decay. 
Pictorially this can be seen from the first diagram in Figure 11, which represents essentially 
the amplitude squared for the partonic decay of a b-quark. Note also that perturbative 
QCD corrections to I can consistently be taken into account. The gluonic corrections 
to inclusive b-quark decay are infrared safe, as required for I’, in its role as a Wilson 
coefficient of the HQE. Also, corrections proportional to powers of as(mẹ) ^ 1/1n(m,/A) 
are only suppressed by inverse powers of In m, in the heavy-quark limit, and hence formally 
leading in comparison to higher corrections in the HQE, which are suppressed by powers 
of A/m. The calculation of heavy-quark decay in the parton picture has been used since 
the beginnings of heavy-quark physics as an approximation for inclusive decays of the 
corresponding heavy hadrons. As we have seen, the HQE gives a formal justification for 
this approach and provides us with a theoretical framework to compute nonperturbative 
corrections. 


The first correction term in (96) depends on the expectation value of the momentum 
squared (p?) of the heavy quark inside the hadron. This matrix element is non-zero 
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because the heavy quark is recoiling against the light degrees of freedom through gluonic 
interactions in the hadronic bound state. This term has a very intuitive interpretation. 
It corresponds to a correction factor 1 — (p?)/(2m2) = 1 — (v2)/2, which is just the 
reduction of the free decay rate from time dilatation due to the recoil motion of the heavy 
quark. The second correction comes from interactions of the light hadronic cloud with 
the heavy-quark spin. We have 


(mi,—-m$à) H=B 


H-AC (97) 


Q niw 


(hgoGh) = i 


The result is zero for the A, baryon since the light degrees of freedom are in a state of zero 
total angular momentum. Note that the spin interaction enters twice in (95), explicitly 
with coefficient zg and via the expansion of (bb). 


The leading nonperturbative corrections start only at second order. There is no cor- 
rection linear in 1/m». This is because there is no gauge-invariant operator of dimension 
four that could appear in the HQE. 


At order 1/mj contributions appear where the spectator quark participates directly 
in the weak interactions. For b-mesons they can be interpreted as the effect of weak 
annihilation (WA) of the b-quark with the valence d-quark (for B4) and as the effect of 
Pauli interference (PI)(for B,). The latter phenomenon occurs because in the nonleptonic 
decay of a B,, b(ü) — cüd(à), two identical ü-quarks are present in the final state. These 
corrections distinguish, in particular, between Ba and B, mesons and are responsible for 
their lifetime difference. Despite the suppression by three powers of mẹ these effects can 
be relatively important due to their two-body kinematics, which brings a phase-space 
enhancement factor of 167? in comparison to the leading three-body decay. 


As one of the possible applications, the HQE provides us with a theory of heavy- 
hadron lifetimes. The deviations of lifetime ratios from unity probes the power corrections. 
At present there are still sizeable theoretical uncertainties due to the hadronic matrix 
elements (bTq GT). They can in principle be computed with the help of lattice gauge 
theory. Table 2 shows a comparison of theoretical predictions and experimental results 
(see for instance (Ligeti 2001)). 


Table 2. | Experiment Theory 
r(B*)/r(B]) | 1.068 +0.016| 1-141 
T(B,)/T(Ba) | 0.947 + 0.038 | 0.99 — 1.01 
r(Ap)/T(Ba) | 0.795 +0.053 | 0.9 — 1.0 


5.2 Local quark-hadron duality 


A systematic uncertainty within the HQE framework, which is often debated in the lit- 
erature, arises from the issue of quark-hadron duality. In this paragraph we give a brief 
and heuristic discussion of the basic idea behind this topic. 


The theoretical prediction for an inclusive decay rate obtained from the HQE has the 
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Me 


Figure 12. ['/T) as function of my (arbitrary units). 


form 
oo A n 
T/T 21 M5 z (=>) , (98) 
n=2 my 


where we have denoted the leading, free-quark result by 9. Let us consider the decay 
rate as a function of mẹ, keeping A = Agcp constant. Then the quantity l/To, to any 
finite order in (A/m), is à simple polynomial expression in this variable. This is sketched 
as the monotonic curve in Figure 12 showing l'/T', as function of mẹ (in arbitrary units). 
Now since, by construction, the HQE for I/To yields a power expansion in (A/m), any 


term of the form 
may S (mp 
exp (- (F) LE VE (99) 


for example, present in the true result for I'/T'g would be missed by the HQE. This is 
due to the exponential suppression in the expansion parameter. In fact, the function 
exp(—1/z) is non-analytic. Its power expansion around x = 0 gives identically zero. 
However, such (or similar) terms are expected to be part of the true T/T on general 
grounds. The corresponding complete result for I'/I'o, including such a term, is sketched 
as the oscillating graph in Figure 12. This true curve represents the physical result for 
the decay rate l'/To, which consists of the inclusive sum over all the different exclusive 
decay channels. It is intuitively understandable that the true m)-dependence will have 
such a damped oscillating behaviour: if we imagine continually increasing m, l'/Tg will 
undergo a small jump whenever it reaches a value at which the presence of a further higher 
hadronic resonance in the final state becomes kinematically allowed. Since the excited 
hadrons have finite widths, the threshold behaviour will be smoothed out, resulting in the 
pattern of damped oscillations. 


The term quark-hadron duality refers to the idea that the inclusive rate as the sum 
over all exclusive hadronic decay channels and the inclusive rate as predicted by the heavy 
quark expansion are dual to each other. This means they are both valid representations 
of the same quantity using different descriptions, the hadron level or the quark level. The 
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term local refers to the fact that the energy scale my is a fixed quantity, as opposed to e.g. 
the centre-of-mass energy in e*e^ annihilation, which can be averaged to obtain suitably 
defined “global” quantities. In principle, the hadronic description gives the true result, 
measured in experiment. The problem is, however, that we would have to compute all 
exclusive rates first, which is far beyond our current control of nonperturbative QCD. On 
the other hand, the HQE calculation can be performed, within some uncertainties, but 
it is clear that the result need not be identical to the true answer. A deviation between 
the latter and the HQE (including power corrections) is referred to as a violation of 
quark-hadron duality. Indeed, contributions violating quark-hadron duality are expected 
(see(99)), but the numerical size of these terms cannot be strictly computed at present. 
Conceptually this is no problem because they are formally subleading in comparison to 
power corrections, so that the HQE still makes sense even at higher orders. The remaining 
question is how large can violations of quark-hadron duality be numerically. While there 
are at the moment, within the uncertainties intrinsic to HQE, no established cases in 
inclusive B decays where duality is violated, the issue clearly needs further investigation, 
both theoretically and phenomenologically. 


A more detailed account of the status of quark-hadron duality can be found in the 
papers by Blok, Shifman & Zhang (1998), Shifman (2000) and Bigi & Uraltsev (2001). 


5.3 Inclusive semileptonic decays: Vus and Ve 


The HQE cannot only be applied to the total decay rates, but also to inclusive rates 
with specific flavour quantum numbers in the final state, such as semileptonic processes. 
Furthermore one can analyse differential decay rates. 


An example of special interest is the inclusive decay B — X,li, which can be used to 
extract V4. The HQE for the integrated rate has the form 


Gimj 2 Àl T 3A» 6A» 
X, = c 1 : +., 100 
T(B > Xav) 19253 Veal” | 23 + 2m + 25 mi (100) 
with 123 3 
(bb) 214 21022 (bo Gb) = 6X, = = (m3, — m3). (101) 
2m; 2 
The Wilson coefficients read 
z3 = 1- 8r 82? — a* — 122? lInz+O(a,), (102) 
z = -—(1- x), (103) 


where z = (m,/m»)’. 

A major source of theoretical uncertainty for the determination of |V4| using (100) 
is the b-quark mass. This appears to be especially problematic since mp comes with the 
fifth power in (100). Fortunately, however, the actual situation is not as bad. Taking into 
account the phase-space function 25, one finds that the combined dependence on m, and 
m, shows the approximate behaviour 


T(B > Xav) ~ m2? (m, — m)?". (104) 
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Since the difference my — m, is better known than the individual quark masses, the 
corresponding uncertainty is reduced. The quark-mass difference is in fact constrained by 
HQET, which gives (72) 


Ma — Me 


| 
EJ 
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| 
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> 
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H 


3.40 + 0.03 + 0.03GeV, (105) 


where mg = (mg + 3mg«)/4. 
The QCD corrections to z3 are known to O(o;) and partly at O(o2). The special class 


of corrections O(8 !a?) has been calculated to all orders n. 


Numerically the inclusive method gives (Hócker et al. 2001) 
Vip = 0.04076 + 0.00050exp + 0.002044, (106) 


which can be compared with the result from the exclusive determination via B — D*li 
(90). 

One can also try to extract |V,4| from B — X,lv decays. This is more difficult since 
the very large background from semileptonic b — c transitions requires kinematical cuts 
(in the lepton energy, the hadronic or the dilepton invariant mass), which renders the 
HQE less reliable and introduces larger uncertainties. A recent discussion has been given 
by Bauer et al. (2001b). The HQE has further useful applications, for instance in the 
case of the inclusive rare decays B > X,ay, Bo X,4U U, or B > X, avi. 


Exercise 


Show that quark-hadron duality is exactly fulfilled for the semileptonic b — c transition 
rate in the Shifman- Voloshin (small-velocity, or SV) limit my, m, > my — m, > Agen. 
This holds with only two exclusive channels on the hadronic side of the duality relation, 
that is the inclusive rate is saturated as T(B > X,lv) 8 T(B > Div) - T(B > D*lv) in 
this limit. 

Solution: We start from the exclusive differential decay rates in the heavy-quark limit. 
They read (see e.g. Harrison & Quinn 1998): 


d'(B— Dl) — Gi|Vaf 


UM) a TETA (mg + mp)mi(u? - 1^ (v) (107) 
dT(B > D*lv) G2|V.? 
du = ign (mg — mp) mi. Vw? — Mw + 1)? 
4w mz —2wmgm y. + mM,» 
x [1 P^. | £(w). 108 
( + wal (mp — mp)? E(w) ( ) 
In the strict SV limit we have 

MB = Mb, Mp* = mp = Me, m, = m(1 — €), (109) 


where e = (m, — m_)/m, is a small parameter. The variable w is related to the dilepton 
invariant mass q? through 


q? =m, +m - 2mgmpuw. (110) 
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The kinematic limits of g? are easily identified as 


Qa = (me — Me)”, Grin = 0. (111) 
The corresponding limits of w are 
mg +m? 
Wmin = 1, Wmax = A. (112) 
Thy. 


Defining s = w — 1 we have 0 € s < «?/2, where the upper limit is valid to leading order 
in e. Expanded to leading order in e, (107) gives 
G2 IVa? e?/2 G2 [Va]? 

T(B > Div) = es - mia f 8? ds = Soos - (me — my, (113) 
which is the decay rate in the SV limit. In this derivation we have made use of the fact 
that E(w) = €(1) + O(e?), which can be approximated by £(1) = 1. In this way any 
dependence on nontrivial hadronic input has disappeared. Similarly we can expand the 
integral over (108) in € to extract the leading contribution in the SV limit. We obtain 


Gel Val? 


T(B > D*lv) = 
(B > Dil) = o 


(Me — me}. (114) 
We also observe that higher D-meson resonances and hadronic multiparticle states have 
wave functions of the light degrees of freedom that are orthogonal to the ground state 
wave function of the light cloud (identical for D, D* and B) in the SV limit. There is 
therefore no overlap of those higher excitations with the initial B and the corresponding 
rates vanish. 


Finally, we need to take the SV limit of the inclusive rate as obtained from the heavy 
quark expansion in (100). In this limit the second-order power corrections and perturba- 
tive QCD corrections disappear, and we only have to expand the phase space function z3 
in the small-e limit. We find z3 = 646/5 + O(c$) and 

QI ]2 
T(B > Xv) = CI aL (qm, — my. (115) 
We see that indeed the inclusive HQE result (115) is saturated by the sum of just the 
two exclusive rates (113) and (114). Clearly, the SV limit is à very special situation. 
Nevertheless, it is an interesting example of exact (local) quark-hadron duality. Moreover, 
the semileptonic rates into D and D* measured in experiment account for roughly two 
thirds of the inclusive rate, indicating that the SV limit is not even entirely unrealistic. 


6 QCD factorisation in exclusive hadronic B decays 


6.1 Introduction 


Decay amplitudes for exclusive nonleptonic B decays, such as B > 77, can be computed 
starting from the effective weak Hamiltonian discussed in Section 3.3. Whereas the Wilson 
coefficients C; are well understood, the main problem is posed by the hadronic matrix 
elements of the operators Q;. In some cases this problem can be circumvented (CP 
asymmetry in B > J/V Ks), or at least reduced using SU(2) or SU(3) flavour symmetries 
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and an appropriate combination of various channels. However, an improved understanding 
of the QCD dynamics in exclusive hadronic B decays would greatly enhance our capability 
to extract from these processes the underlying flavour physics. 


Indeed, it turns out that the heavy-quark limit leads to substantial simplifications also 
in the problem of hadronic two-body decays of heavy hadrons. Again the main feature 
is the factorisation of short-distance and long-distance contributions. In the case of the 
matrix elements of four-quark operators Q; the factorisation takes the form 


(OQIB) = e) [ du Tiuta) 
+f d£dudv T?! (€, u,v) ®p(€)®,(u)®,(v). (116) 


This factorisation formula is valid up to corrections of relative order Agcp/ms. Here 
f8>"(q?) is a B — m form factor evaluated at q? = m2 = 0, and ©, ($5) are leading- 
twist light-cone distribution amplitudes “wave functions”) of the pion (B meson). These 
objects contain the long-distance dynamics. The short-distance physics, dominated by 
scales of order mẹ, is described by the hard-scattering nD (ee TL ?!. which are calculable 
in perturbation theory. 77 starts at O(o9), T/! at O(a!) (see Figure 13). In (116) long- 


AL of N\A 
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Figure 13. Order a, corrections to the hard scattering kernels T]. (first two rows) and 
Ti! (last row). In the case of T], the spectator quark does not participate in the hard 
interaction and is not drawn. The two lines directed upwards represent the two quarks 
forming the emitted pion. 


and short-distance contributions are thus systematically disentangled, that is factorised. 
The long-distance sensitive quantities (form factors and wave functions) still need to be 
determined by other means, but they are universal quantities and much simpler than the 
original full B — az matrix elements we started with. They could in principle be cal- 
culated by nonperturbative methods or extracted experimentally from other observables. 
In any case (116) represents a substantial simplification of our problem. 


The general expression (116) further simplifies when we neglect perturbative œ,- 
corrections. The T! term is then absent and the kernel T becomes a constant in u, 
such that the pion distribution amplitude integrates to the pion decay constant. The 
matrix element of operator Qt, for instance, reduces to 


(ntr |(ub)y—a(du)y—a|B) > (27 |(ab)y—4|B)- (7 |(du)v-4|0) = im fP7"(0)f.. (117) 


This procedure, termed “naive factorisation” has long been used in phenomenological 
application, but the justification had been less clear. An obvious issue is the scheme 
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and scale dependence of the matrix elements of four-quark operators, which is needed to 
cancel the corresponding dependence in the Wilson coefficients. This dependence is lost 
in naive factorisation as the factorised currents are scheme independent objects. In QCD 
factorisation (116) the proper scale and scheme dependence is recovered by the inclusion 
of O(as) corrections as we will see explicitly below. 


A qualitative justification for (117) had been given by Bjorken (1989). It is based on 
the colour transparency of the hadronic environment for the highly energetic pion emitted 
in B decay (the 7~ in the above example, which is being created from the vacuum). This 
is related to the decoupling of soft gluons from the small-size colour-singlet object that 
the emitted pion represents. The QCD factorisation approach as encoded in (116) may be 
viewed as a consistent formalisation and generalisation of Bjorken's colour transparency 
argument. This treatment of hadronic B decays is based on the analysis of Feynman dia- 
grams in the heavy quark limit, utilising consistent power counting to identify the leading 
contributions. The framework is very similar in spirit to more conventional applications 
of perturbative QCD in exclusive hadronic processes with a large momentum transfer, as 
the pion electromagnetic form factor (see the article by Sterman and Stoler (1997) for a 
recent review). It justifies and extends the ansatz of naive factorisation. In particular the 
method includes, for B — 77, the hard nonfactorisable spectator interactions, penguin 
contributions and re-scattering effects (Figure 13). As a corollary, one finds that strong 
re-scattering phases are either of O(a,), and calculable, or power suppressed. In any case 
they vanish therefore in the heavy quark limit. QCD factorisation is valid for cases where 
the emitted particle (the meson created from the vacuum in the weak process, as opposed 
to the one that absorbs the b-quark spectator) is a small size colour-singlet object, e.g. 
either a fast light meson (7, o, K, K*) or a J/V. 


Note that the term factorisation is used here for two a priori entirely different things. 
In the case of QCD factorisation (116), it refers to the factorisation of short-distance 
and long-distance contributions. In the sense of the phenomenological approach of naive 
factorisation (117), it simply denotes the separation of the hadronic matrix element of 
a four-quark operator into two factors of matrix elements of bilinear currents. [t is a 
nontrivial result that the latter, naive factorisation is obtained as the lowest order ap- 
proximation of QCD factorisation. To avoid confusion, it is useful to keep the distinction 
in mind. For example, the hard gluon exchange corrections between the two quark cur- 
rents in Figure 13 are "nonfactorisable" in the sense of naive factorisation, although they 
are a consistent ingredient of (116), hence "factorisable" in the sense of QCD. 


In the following we shall discuss QCD factorisation in some detail using the example 
of B — Dr decays. In this case the b > u transition current is replaced by a heavy-heavy 
b — ccurrent. This case is somewhat simpler than B — 7 since the spectator interaction 
(the T"! term, bottom line of Figure 13) does not contribute to leading power. This is 
because for a heavy-to-heavy transition the spectator quark, and hence the gluon attached 
to it, is always soft. This leads to a suppression, according to the colour transparency 
argument, when this gluon couples to the emitted pion. Also penguin contributions are 
absent for B + Dr. We shall illustrate explicitly how factorisation emerges at the one- 
loop order in this specific case, and in the heavy-quark limit, defined as my, mc >> Agcp 
with m/m, fixed. 


Further details on QCD factorisation in B decays and additional literature can be 
found in the articles by Beneke et al. (1999, 2000 & 2001). 
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6.2 B — Dr: factorisation to one-loop order 
6.2.1 Preliminaries 


The effective Hamiltonian relevant for B — D can be written as 


Her = aN, VA (CyOo + CgQs) , (118) 

with the operators 
Op = &*(0 — 3s)b do, (1 — s)u, (119) 
Os = GyY'(1— y)T"bdvy,(1— ys)T"u. (120) 


Here we have chosen to write the two independent operators in the singlet-octet basis, 
which is most convenient for our purposes, rather than in the more conventional bases of 
Qi, Q2 or Q4, Q- (see the discussion in Section 3; because all four quark flavours are 
different in (118), penguin operators are absent). The Wilson coefficients Cy, Cg have 
been calculated at next-to-leading order in renormalisation-group improved perturbation 
theory (Altarelli et al. 1981, Buras & Weisz 1990) and are given by 


= eio, N 1 € —- C. -C_, (121) 
where 
C, (u) = ( + en.) Cu), (122) 
d4 
Ci) = e] h + SO) at g, 1.) | (123) 


(The coefficients Co, Cs are related to the ones of the standard basis by Cy = C, + C/3 
and Cs = 2C5.) We employ the next-to-leading order expression for the running coupling, 


4n 6, In a fakon], 
s = 124 
es) fo In(u 2/ Aco) | pê In( 2/ Aco) (124) 
|. MN, — 2f 34,5. 10 | N2-1 z 
Bo = 3 , Bi 3 N; 3 Nef 2Crf, Cr = 2N, , (125) 


where N, is the number of colours, and f the number of light flavours. Agcp = AUD is 
the QCD scale in the MS scheme with f flavours. Next we have 


(0) 
Y+ Ne F1 Ne F1 

d, = 4, = +12 — B, = t——.— B. 12 

* 2 oN,’ * 2N, (126) 


The general definition of J+ may be found in (Buchalla, Buras & Lautenbacher 1996). 
Numerically, for Ne = 3 and f 25 


$ 6473 
d+ = | 12 B+- J= | _ 9371 (127) 
23° 1587" 
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(a) (b) (c) 


Figure 14. Basic quark-level topologies for B —^ Dx decays (q = u, d): (a) class-I, (b) class- 
II, (c) weak annihilation. Bg — D*n- receives contributions from (a) and (c), Ba — Ds? 
from (b) and (c), and B- + DÜn- from (a) and (b). Only (a) contributes in the heavy-quark 
limit. 


The quantities 8), 81, d+, B.—J are scheme independent. The scheme dependence of the 
coefficients at next-to-leading order is parametrised by B, in (122). In the naive dimen- 
sional regularisation (NDR) and ‘t Hooft-Veltman (HV) schemes, this scheme dependence 
is expressed in a single number B with Bxpg = 11 and Byy = 7. The dependence of 
the Wilson coefficients on the renormalisation scheme and scale is cancelled by a corre- 
sponding scale and scheme dependence of the hadronic matrix elements of the operators 
Oy and Os. 

Before continuing with a discussion of these matrix elements, it is useful to consider 
the flavour structure for the various contributions to B — Ds decays. The possible 
quark-level topologies are depicted in Figure 14. In the terminology generally adopted 
for two-body non-leptonic decays, the decays By > Dtr, By  D?sz? and B- > D?s- 
are referred to as class-I, class-II and class-III, respectively. In both By > Dtr- and 
B- — D°n~ decays the pion can be directly created from the weak current. We may 
call this a class-I contribution, following the above terminology. In addition, in the case 
of B, > Dtr” there is a contribution from weak annihilation and a class-II amplitude 
contributes to B^ — D°r~, see Figure 14. The important point is that the spectator 
quark goes into the light meson in the case of the class-II amplitude. This amplitude is 
suppressed in the heavy-quark limit, as is the annihilation amplitude. The amplitude for 
B, > D?s, receiving only class-II and annihilation contributions, is therefore subleading 
compared with By > D*z- and B- — D*s-, which are dominated by the class-I 
topology. The treatment of this leading class-I mechanism will be the main subject of 
the following sections. We shall use the one-loop analysis for Bg > D*-- as a concrete 
example on which we will illustrate explicitly how the factorisation formula can be derived. 


6.2.2 Soft and collinear cancellations at one-loop 


In order to demonstrate the property of factorisation for Ba — Dtr”, we have to anal- 
yse the “non-factorisable” one-gluon exchange contributions (Figure 15) to the b > cid 
transition. We consider the leading, valence Fock state of the emitted pion. This is justi- 
fied since higher Fock components only give power-suppressed contributions to the decay 
amplitude in the heavy-quark limit. The valence Fock state of the pion can be written as 


du al, 1 


Ira) = ET JEN (ah (04)}f (lg) — af (lg)oK(Ug)) 10) B(us1i), — (128) 
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(a) (b) (c) (d) 


Figure 15. "Non-factorisable" vertez corrections. 


where at (bi) denotes the creation operator for a quark (antiquark) in a state with spin 
s =t or s =|, and we have suppressed colour indices. This representation of the pion 
state is adequate for a leading-power analysis. The wave function WV(u,l,) is defined 
as the amplitude for the pion to be composed of two on-shell quarks, characterised by 
longitudinal momentum fraction u and transverse momentum /,. The on-shell momenta 
(a = 0) of the quark (/,) and the antiquark (/;) are given by 


1? ; h 
l= ug tL gno l= ugl + Fen. (129) 


In these expressions q = E(1,0,0,1) is the pion momentum, E = pg - g/mg the pion 
energy and n- = (1,0,0, —1). Furthermore l-g = lien- = 0. For the purpose of power 
counting |, ^ Agcp « E ~ m. Note that the invariant mass of the valence state is 
(la +13)? = 1 /(uü), which is of order A5cr and hence negligible in the heavy-quark limit, 
unless u is in the vicinity of the endpoints (0 or 1). In this case the invariant mass of 
the quark-antiquark pair becomes large and the valence Fock state is no longer a valid 
representation of the pion. However, in the heavy-quark limit the dominant contributions 
to the decay amplitude come from configurations where both partons are hard (u and 
ü both of order 1) and the two-particle Fock state yields a consistent description. The 
suppression of the soft regions (u or i « 1) is related to the endpoint behaviour of the 
pion wave function. We will provide an explicit consistency check of this important feature 
later on. 


As a next step we write down the amplitude 


(x(a)lulO)adta)ai0) = f durs e 0L) eulos expC ily}, — 30 


which appears as an ingredient of the B — Ds matrix element. The right-hand side 
of (130) follows directly from (128). Using (130) it is straightforward to write down the 
one-gluon exchange contribution to the B — Ds matrix element of the operator Og 
(Figure 15). We have 


(D* 1^ |Os| Ba) 1 giuon = (131) 


. Cr f dk - -.1 p, dH, V*(uli) 
2yr + ; i 
idi | y P Abl Bag n Sa tally la I) 
where 
À à 
YWfe— HEM) Vp t Em) 
Ai(k) = 
i(k) 2p-k — k? Ipp ktk C (132) 


Ca +E yaa + hr 
2lyk + K? 2l-k--k? 


(133) 
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Here I = 4^"(1— 5) and pp, p, are the momenta of the b quark and the c quark, respectively. 
Note that this expression holds in an arbitrary covariant gauge. The gauge-parameter 
dependent part of the gluon propagator gives no contribution to (131), as can be seen 
from (132) and (133). There is no correction to the matrix element of Op at order as, 
because in this case the (dà) pair is necessarily in a colour-octet configuration and cannot 
form a pion. 


In (131) the pion wave function V(u,1,) appears separated from the B — D transition. 
This is merely a reflection of the fact that we have represented the pion state by (128). 
It does not, by itself, imply factorisation, since the right-hand side of (131) involves still 
nontrivial integrations over 1, and gluon momentum k, and long- and short-distance 
contributions are not yet disentangled. In order for (131) to make sense we need to 
show that the integral over k receives only subdominant contributions from the region of 
small k?. This is equivalent to showing that the integral over k does not contain infrared 
divergences at leading power in 1/m,. 


To demonstrate infrared finiteness of the one-loop integral 
1 
J= | d'k Ak) 8 Ass k) (134) 


at leading power, the heavy-quark limit and the corresponding large light-cone momentum 
of the pion are again essential. First note that when k is of order mẹ, J ~ 1 for dimensional 
reasons. Potential infrared divergences could arise when k is soft or when k is collinear 
to the pion momentum q. We need to show that the contributions from these regions are 
power suppressed in mọ. (Note that we do not need to show that J is infrared finite. It 
is enough that logarithmic divergences have coefficients that are power suppressed.) 


We treat the soft region of integration first. Here all components of k become small 
simultaneously, which we describe by scaling k ~ A. Counting powers of A (dik ~ M, 
1/k? ~ A7?, 1fp - k ~ A`!) reveals that each of the four diagrams (corresponding to 
the four terms in the product in (134)) is logarithmically divergent. However, because 
k is small, the integrand can be simplified. For instance, the second term in A, can be 
approximated as 


2 


mene nilan) HEE a > 


- "ES 135) 
21, - ka KP E ROC ( 
ett Qug-k+2l,-k+{—L\n kth 4 

2uE 


where we used the fact that g to the extreme left or right of an expression gives zero due 
to the on-shell condition for the external quark lines. We get exactly the same expres- 
sion but with an opposite sign from the other term in A? and hence the soft divergence 
cancels out when adding the two terms in As. More precisely, we find that the integral 
is infrared finite in the soft region when l, is neglected. When /, is not neglected, there 
is a divergence from soft k which is proportional to  /m; ~ Agcp/m. In either case, 
the soft contribution to J is of order Agcp/ms or smaller and hence suppressed relative 
to the hard contribution. This corresponds to the standard soft cancellation mechanism, 
which is a technical manifestation of colour transparency. 


Each of the four terms in (134) is also divergent when k becomes collinear with the 
light-cone momentum q. It is convenient to introduce, for any four-vector v, the light-cone 
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components 
v? + v? 
UE = V2 (136) 
The collinear region is then described by the scaling 
ke, kA koX. (137) 


Then dik ~ dk*dk-d?k, ~ M and q- k = q*^k- ~ X, k? = 2k*k^ + k? ~ M. The 
divergence is again logarithmic and it is thus sufficient to consider the leading behaviour 
in the collinear limit. Writing k = aq +... we can now simplify the second term of A» as 


ye E)P |. 2(u t a)r 
QW, ka kh PD kF k? 


(138) 


No simplification occurs in the denominator (in particular |, cannot be neglected), but the 
important point is that the leading-power contribution is proportional to qa. Therefore, 
substituting k = aq into A, and using q? = 0, we obtain 


f(f.- mV — P(fo + mod 
2ape: q 2app -q 


QÁ ~ =0, (139) 
employing the equations of motion for the heavy quarks. Hence the collinearly divergent 
region is seen to cancel out via the standard collinear Ward identity. This completes 
the proof of the absence of infrared divergences at leading power in the hard-scattering 
kernel for By — Dtr” to one-loop order. In other words, we have shown that the ^non- 
factorisable” diagrams of Figure 15 are dominated by hard gluon exchange. The proof at 
two loops has been given by Beneke et al. (2000) and a proof to all orders by Bauer et 
al. (2001c). 
Since we have now established that the leading contribution to J arises from k of order 
m, (“hard” k), and since |l;| « E, we may expand Ag in |l; |/ E. To leading power the 
expansion simply reduces to neglecting l} altogether, which implies lą = ug and I; = tug 
n (133). As a consequence, we may perform the lı integration in (131) over the pion 
wave function. Defining 
d, W*(uli) if, 
16:5? /2N, AN, 


the matrix element of Og in (131) becomes 


li 


$, (u), (140) 


(Dr Isl Ban gluon = (141) 


d^ d 
— NL 8N, | em k)bIBa) s TUN du 9, ( u)tr [ys Aol uq, üq, k)]. 


Putting y on the light-cone in (130), y* = y1 = 0, hence l4: y = lzy ^ = uqy, we see 
that the /;-integrated wave function $,(u) in (140) is precisely the light-cone distribu- 
tion amplitude of the pion in the usual definition. Indeed, the leading-twist light-cone 
distribution amplitude for pseudoscalar mesons (P) reads 


(POI) sl). yoy = es). [duc m Bplay). 042 


(z-y)?-0 
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Here it is understood that the operator on the left-hand side is a colour singlet. We 
use the “bar”-notation, i.e. 0 = 1 — v for any longitudinal momentum fraction variable. 
The parameter p is the renormalisation scale of the light-cone operators on the left-hand 
side. The distribution amplitude is normalised as f] du®p(u,u) = 1. One defines the 
asymptotic distribution amplitude as the limit in which the renormalisation scale is sent. 
to infinity. The asymptotic form is 


$p(u, u) "© bua. (143) 


The decay constant appearing in (142) refers to the normalisation in which f; = 131 MeV. 
There is a path-ordered exponential that connects the two quark fields at different posi- 
tions and makes the light-cone operators gauge invariant. In (142) we have suppressed 
this standard factor. 


This derivation demonstrates the relevance of the light-cone wave function for the 
factorisation formula. Note that the collinear approximation for the quark-antiquark 
momenta emerges automatically in the heavy-quark limit. 


After the k integral is performed, the expression (141) can be cast into the form 
- 1 
(D+ |Os|Ba)1-guon ~ Fp-sp(0) [ du Ts(u, 2)®,(u), (144) 


where z = m/m», Ta(u, z) is the hard-scattering kernel, and Fpg.,p(0) the form factor 
that parametrises the (D+|č[. ..]b| B3) matrix element. The result for Tg(u, z) will be given 
in the following section. 


6.2.3 Matrix elements at next-to-leading order 


As we have seen above, the By > D*z- amplitude factorises in the heavy-quark limit 
into a matrix element of the form (D*|é{. . .]b| Ba) for the B — D transition and a matrix 
element (z^ |d(z)[. ..]u(0)|0) with x? = 0 that gives rise to the pion light-cone distribution 
amplitude. Leaving aside power-suppressed contributions, the essential requirement for 
this conclusion was the absence of both soft and collinear infrared divergences in the gluon 
exchange between the (cb) and (du) currents. This gluon exchange is therefore calculable 
in QCD perturbation theory. We now present these corrections explicitly to order o. 


The effective Hamiltonian (118) can be written as 


_ Gp v Netla N.—14 os(u) Cr 
Her = Z Vaa Veb | Res + ON, C. (u) + dn 2N, BCs(u) | Oo 
+ [Cs] Os}, (145) 


where the scheme-dependent terms in the coefficient of the operator Oo, proportional to 
the constant B defined after (127), have been written explicitly. 


Schematically, the matrix elements of both Oy and Og can be expressed in a form 
analogous to (144). Because the light-quark pair has to be in a colour singlet to produce 
the pion in the leading Fock state, only Oy gives a contribution to zeroth order in ag. 
Similarly, to first order in o; only Os can contribute. The result of computing the diagrams 
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in Figure 15 with an insertion of Og can be written in a form that holds simultaneously 
for H = D,D* and L = s, p, using only that the (ud) pair is a colour singlet and that 
the external quarks can be taken on-shell. One obtains (z = m./m;) 


(HOLOE) = SSE us f’ dutu) (146) 


x |- ( In +B) ((Jy)—(J4)) + F(u, z) (Jv) - F(u, —2) Ja) | , 


where 
(Jv) = (H(p')lēg b|Ba(p)), — (Ja) = (H(p') ets) |Ba(p))- (147) 

In obtaining (146) we have used the equations of motion for the quarks to reduce the 
operator basis to Jy and J4. The form of (146) is identical for pions and longitudinally 
polarised p mesons. The production of transversely polarised p mesons is power suppressed 
in Agcp/™p. 

In the case of a distribution amplitude ®; (u) that is symmetric under u à, which 
is relevant for L = 7, p, the function F (u, z) appearing in (146) can be compactly written 
as 


F(u,z) 2 31nz? — 7+ f(u,z) + f(u, 1/2), (148) 
with 
| w(1 — z?)[3 — u(1— 2?)) + z] nlu 2 z 
f(u, z) m i _ u(1 ;2)? l [ (1 )] 1- u(1 m zy (149) 


The contribution of f(u,z) in (148) comes from the first two diagrams in Figure 15 with 
the gluon coupling to the b quark, whereas f(u,1/z) arises from the last two diagrams 
with the gluon coupling to the charm quark. The absorptive part of the amplitude, which 
is responsible for the occurrence of strong rescattering phases, arises from f(u,1/z) and 
can be obtained by recalling that z? is z? — ie with e > 0 infinitesimal. We then have 


(150) 


Z Im F(u, 2) — (1 u)(1 27)[3(1 zm 2)) +2) 


[I — u(1 — 22)? 


As mentioned above, (146) is applicable to all decays of the type By — Doer, 
where L is a light hadron such as a pion or a (longitudinally polarised) p meson. Only the 
operator Jy contributes to By + D* L-, and only J, contributes to By — D** L-. (Due 
to helicity conservation the vector current B — D* matrix element contributes only in 
conjunction with a transversely polarised p meson and hence is power suppressed in the 


heavy-quark limit.) Our final result can therefore be written as 
— - l 
(D* L7 |Oss| By) = (D*ley"(1 — s)b| Ba) ifi] du Tasglu, 2) 6, (u), (151) 


where L — 7, p, and the hard-scattering kernels are 


To(u,z) = l-4O(o2), (152) 
as Cr pe 2 
Ta(u, z) dx 2N, | 61n m B+ F2) + O(a). (153) 
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When the D meson is replaced by a D* meson, the result is identical except that F (u, z) 
in (153) must be replaced by F(u, —z). Since no order a, corrections exist for Oo, the 
matrix element retains its leading-order factorised form 


(D* L |Oo| Ba) = ifiqu (Dte (1 — *s)5| B4) (154) 


to this accuracy. From (149) it follows that T&(u, z) tends to a constant as u approaches 
the endpoints (u — 0, 1). Therefore the contribution to (151) from the endpoint region 
is suppressed, both by phase space and by the endpoint suppression intrinsic to P, (u), 
which can be represented as 


$;(u) = 6u(1— u) re Xt ) C3? (2u — 1)| , (155) 


vanishing ~ u (~ (1 — u)) at the endpoints. Here C?! (x t= 3x, CR? (x)= 2 (52?—1), etc. 
are Gegenbauer polynomials. The Gegenbauer moments oL (u) are nonperturbative quan- 
tities. They are multiplicatively renormalised and approach zero as u — oo. (The scale 
dependence of these quantities enters the results for the coefficients only at order o, 
which is beyond the accuracy of a NLO calculation.) 


As a consequence of the endpoint suppression the emitted light meson is indeed dom- 
inated by energetic constituents, as required for the self-consistency of the factorisation 
formula (151). 

Combining (145), (151), (152) and (153), we obtain our final result for the class-I, 
non-leptonic By > pe)*r- decay amplitudes in the heavy-quark limit, and at next- 


to-leading order ir. œs. The results can be compactly expressed in terms of the matrix 
elements of a "transition operator" 


G 
= V aa cb [a (DL) Qv - a (D*L) )Q4]. (156) 
where - - 
Qv = cy'b @ dy,(1—55)u, Qa = ey"ysb & dy,(1— ys)u, (157) 


and hadronic matrix elements of Qy,4 are understood to be evaluated in factorised form, 
i.e. 
(DL|Ài ® j2|B) = (D|Iji| B) (£5210). (158) 


Equation (156) defines the quantities a; (D*)L), including the leading “non-factorisable” 
corrections, in a renormalisation-scale and -scheme independent way. To leading power in 
Aacp/ms these quantities should not be interpreted as phenomenological parameters (as 
is usually done), because they are dominated by hard gluon exchange and thus calculable 
in QCD. At next-to-leading order we get 


N.-1 wo = 
a(DL) = ON, C, (u) + lo (u) 
Os Cr o 
Tr 2N, © me a du F(u, z) situ), (159) 
ey Nl Ne-la , 
a(D'L) = ON, C4 (u) + ON, le (u) 


Qs Cr "n 1 
— $ . 
um 2N, e) | 6In m? «f du F(u, —z) sto) (160) 
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These expressions generalise the well-known leading-order formula 


(u) + (n). (161) 


N.-1 
LO c LO 
C 2N: 


a, = 2N, + 
We observe that the scheme dependence, parametrised by B, is cancelled between the co- 
efficient of Oo in (145) and the matrix element of Os in (151). Likewise, the u dependence 
of the terms in brackets in (159) and (160) cancels against the scale dependence of the 
coefficients C, (ui), ensuring a consistent physical result at next-to-leading order in QCD. 
The coefficients a,(DL) and a; (D* L) are seen to be non-universal, i.e. they are ex- 
plicitly dependent on the nature of the final-state mesons. This dependence enters via the 
light-cone distribution amplitude 9; (u) of the light emission meson and via the analytic 
form of the hard-scattering kernel (F(u, z) vs. F(u, —2)). However, the non-universality 
enters only at next-to-leading order. 


Exercise 


Verify that the coefficients a, (D) L) in (159) and (160) are independent of the unphysical 
renormalisation scale u up to terms of O(o2). 


6.2.4 Phenomenological applications for B — Dr 


An important test of QCD factorisation can be performed by comparing the hadronic 
decays By > DC)* L- with the semileptonic processes By — D*)+I-v. It is useful to 
introduce the ratios 


= 6r? Vaf la (DLP X?, ^ 6) 


=m 


where X, = X; — ] for a vector meson (because the production of the lepton pair via 
a V — A current in semi-leptonic decays is kinematically equivalent to that of a vector 
meson with momentum q), whereas X, and X7 deviate from 1 only by (calculable) terms 
of order m2/m?,, which numerically are below 1%. The main virtue of (162) is that 
the B — D(*) form factors cancel in the ratio. The theoretical prediction for the QCD 
coefficients, based on QCD factorisation to leading power and at NLO in perturbative 
QCD, is la (D) L)| — 1.05. The uncertainty of this leading-power result is small, about 
+0.01. The prediction is to be compared with the experimental results, extracted from 
(162), which read |a;(D*)| = 1.08 + 0.07, |a, (D*p)| = 1.09 + 0.10 and |a;(D*a;)| = 
1.08 +0.11. These values show fair agreement with the theoretical number, albeit within 
experimental uncertainties that are still large. 

Another interesting consideration concerns the comparison of class-I modes with those 
of class H and III. For B > D*)s, all three decay modes, which are related by isospin, 
have been measured. A nice discussion of the present experimental status and its in- 
terpretation in the context of QCD factorisation has been given by Neubert & Petrov 
(2001). Let us briefly discuss here a few important aspects. The experimental status is 
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Bo Dn | Bo D*r 
Bo > Dtr | 30404 [2364021 
B° + D*)9n° | 0.27 £ 0.05 | 0.17 + 0.05 
B 2 DOr | 53405 | 46x04 
| IC — AVIT + Al | 0.42 0.05 | 0.35 + 0.05 
brc (56 + 20)° | (51 + 20)° 
Table 3. Experimental data for B > Ds branching ratios (in units of 107?) and 
extracted parameters |C — A|/|T + Al, órc (see Neubert & Petrov (2001)). 


summarised in Table 3. Denoting the basic topologies from Figures 14(a,b,c) by T, C 
and A, respectively (where the notation refers to “tree”, “colour-suppressed tree" and 
*annihilation"), we have 


A(B,— D'a) = T+A, (163) 
V2. A(By > Ds?) = C— A, (164) 
A(B™ > D?x ^) T4 C. (165) 


For later use we may further define the spectator-interaction contribution to T, Tspec (see 
bottom row of Figure 13). A similar decomposition holds for the B — D*a modes. Isospin 
symmetry is reflected in the amplitude relation A( Ba > D*z-) + V2A(B4 > D?x?) = 
A(B- — D9z-), which is manifest in the parametrisation of (163) - (165). This means 
that there are only two independent amplitudes, which we can take to be (T' 4- A) and 
(C — A). These amplitudes are complex due to strong phases from final-state interactions. 
Only the relative phase of the two independent amplitudes is an observable and we define 
órc to be the relative phase of (T + A) and (C — A). The phase can then be extracted 
from the data through the relation 


(253) B(B-—D?x-) - B(B") S D*z-) - 2B(B95 D9s?) 


cos orc = ( 166) 


2/2. B( B°>Dtr-) B(B°+D°r°) 


In the usual heavy-quark limit, my ~ m, 2» Agcp, only T is calculable. Tspec, C and 
A are not, but they are power suppressed. It is instructive to consider the alternative 
limit my 2» m, 3» Agcp, which allows us to distinguish between m, and m, (Beneke et 
al. 2000). In this case, due to m, « mp», also the D becomes a "light" meson and in 
that respect the process is similar to B — nn, where both Typec and C are calculable. 
However, since m, >> Agcp = A, the D-meson wave function ®p(u) is highly asymmetric 
and strongly peaks at @ = (1 — u) ~ A/m,, where u is the c-quark momentum fraction. 
These properties can be used to derive the scaling rules 


Tope A C A 


Toms T (167) 
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The amplitude A is still not calculable in this scheme, while T, Tspec and C are. Note 
that from (167) we can recover the two standard scenarios we have been discussing: In 
the heavy-quark limit m, ~ my (167) reduces to a simple power suppression ^ A/m for 
A, Tij« and C compared to T. On the other hand, if m, becomes a truly light quark, 
corresponding to the case of B — s, we count m, ~ A and see that both Type, and C 
are of the same order as T, while A is still power suppressed. 


The general scenario my > m. > Aocp allows us to interpret the experimental 
results in Table 3. We can even make numerical estimates for T, Typec and C based on 
the factorisation formula for light-light final states (116). These are somewhat model 
dependent because ®p is not known at present. It is not too difficult to accommodate 
substantial values |C — A|/|T + A| ~ 0.2 - 0.3 and órc ~ 40°, in qualitative agreement 
with Table 3. Given the special role of the charm quark (not light, but also not too 
heavy), the current experimental situation is not in contradiction with QCD factorisation 
in the large-m, limit. For a comparison with experiment it is useful to keep in mind that, 
according to (167), the suppression of C over T is only ~ A/m, (not A/m,) and that órc 
is naturally expected to be of order one. 


6.3 CP violation in B — ata decay 


Hadronic B decays into a pair of light mesons, such as B — 7K or B — 7, have a very 
rich phenomenology. Their main interest lies in their sensitivity to short-distance flavour 
physics, including CKM parameters, CP violation and the search for new physics. By 
way of an outlook we mention here the important example of CP violation in B — ntr 
decay. The starting point for computing the required decay amplitudes is the effective 
Hamiltonian in (32). The needed matrix elements of the operators Q; can be analysed 
within QCD factorisation using (116). We will not go into the technical details of such 
an analysis and the discussion of limitations of the approach, in particular from power 
corrections in Agcp/mp. These can be found in (Beneke et al. 2001). Here we just want 
to present the phenomenological motivation and to illustrate that a theoretical approach 
towards a direct dynamical calculation of hadronic matrix elements will be very valuable, 
even if it necessitates some approximations. 


The observable of interest is the time-dependent CP asymmetry in the decays of B? 
and B? to ntr”; it is sensitive to the B4- B, mixing phase e 7?. We define 


AR (4) B(B?(t) — ntr) — B(B°(t) > ata) 
cP B(B°(t) + ntr) + B(B*(t) ^ n*n-) 
= —5,,sin(Amg t) + C, cos(Amg t) (168) 
where 
21mA 1- eal? ase + Pan |T, 
i Qa lA MELLE 
Se = DF ia OF TSP © e x Parf Tor (169) 


The amplitudes T,, (“tree”) and Pyr (“penguin”) are the components of the B — ntr 
amplitude corresponding to the terms in (32) involving A, and Àe, respectively. In the 
standard phase conventions A, is real and A, has a weak phase — y, which has been factored 
out above. The coefficient C,,, which is a function of y, represents direct CP violation 
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Figure 16. Relation between sin 2a and the mixing-induced CP asymmetry Srn, assuming 
sin 28 = 0.48. The dark band reflects parameter variations of the first kind, the light band 
shows the total theoretical uncertainty. The lower portion of the band refers to values 
45° < a < 135°, the upper one to 0 < a < 45° (right branch) or 135° < a < 180° — 8 
(left branch). 


and is expected to be small. We shall not discuss it further here. The mixing-induced 
asymmetry S,, depends on y and 2. In fact, in the limit where P, /T;., is set to zero it 
follows that Asr = e-?!0*» — e?* and hence Spr = sin 2a. In this limit A, is just the 
relative weak phase between the direct amplitude B — rtr” and the one with mixing 
B — B — «*r-. All dependence on hadronic input has cancelled in this situation. In 
practice, however, P, /T, is not fully negligible. It is here that information on hadronic 
dynamics becomes crucial. QCD factorisation predicts that P. /T,.,, is suppressed (either 
by a, or by powers of Agcn/my), because Tyr can arise at tree level, Pyn only through 
loops. Estimates within this framework give values of about 0.25-0.3. 


To illustrate the effect of penguin amplitudes, we first assume that |Va»/Væl| and the 
weak phase 8 have been determined accurately. Then using y = 180? —a— f the expression 
for Arz in (169) becomes a function of a and our prediction for the penguin-to-tree ratio 
Prix /Tar. If we further assume that the unitarity triangle lies in the upper half of the 
(p, 7) plane, then a measurement of S,, determines sin 2a with at most a two-fold discrete 
ambiguity. Figure 16 shows the relation between the two quantities for the particular case 
where |V,5/V.,| = 0.085 and 8 = 14.3°, corresponding to sin 28 = 0.48. The dark band 
shows the theoretical uncertainty due to input parameter variations, whereas the light 
band indicates the total theoretical uncertainty including estimates of the effect of power 
corrections. We observe that for negative values sin 2a as preferred by the global analysis 
of the unitarity triangle, a measurement of the coefficient S,,, could be used to determine 
sin 2a with a theoretical uncertainty of about +0.1. Interestingly, for such values of sin 2a 
the "penguin pollution" effect enhances the value of the mixing-induced CP asymmetry, 
yielding values of Syr between —0.5 and —1. Such a large asymmetry should be relatively 
easy to observe experimentally. 
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Figure 17. Allowed regions in the (6,7) plane corresponding to constant values of the 
mizing-induced asymmetry S,, assuming the Standard Model. The widths of the bands 
reflect the total theoretical uncertainty. The corresponding bands for positive values of 
San are obtained by a reflection about the p axis. The light circled area in the upper-right 
quadrant shows the allowed region obtained from the standard global fit of the unitarity 
triangle (Hócker et al. 2001). 


Although it illustrates nicely the effect of “penguin pollution" on the determination 
of sin 2a, Figure 16 is not the most appropriate way to display the constraint on the 
unitarity triangle implied by a measurement of S,,. In general, there is a four-fold discrete 
ambiguity in the determination of sin 2a, which we have reduced to a two-fold ambiguity 
by assuming that the triangle lies in the upper half-plane. Next, and more importantly, we 
have assumed that |V,,/V.4| and 8 are known with precision, whereas o is undetermined. 
However, in the Standard Model |V,,/V4j| and the angles a, B, are all functions of the 
Wolfenstein parameters p and 7. It is thus more appropriate to represent the constraint 
implied by a measurement of S,, as a band in the (f,7) plane. To this end, we write 


qno PT) 00s 9 -2m-p ^^ Pa rct (170) 
BENE ü-ae$ Co oT. pog 


where r,4 €/?** is independent of p and 7. We now insert these relations into (169) and 
draw contours of constant S,, in the (p,7) plane. The result is shown by the bands in 
Figure 17. The widths of the bands reflect the total theoretical uncertainty (including 
power corrections). For clarity we show only bands for negative values of Syr; those 
corresponding to positive S,, values can be obtained by a reflection about the p axis (i.e., 
7j + —7). Note that even a rough measurement of Spr would translate into a rather narrow 
band in the (p, 7) plane, which intersects the ring representing the |Vas/Væl| constraint at 
almost a right angle. In a similar way, the constraint is also quite robust against hadronic 
uncertainties. Even the approximate knowledge of hadronic matrix elements, as provided 
by QCD factorisation, will therefore be very valuable and can lead to powerful constraints 
on the Wolfenstein parameters. 
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7 Summary 


In these lectures we have discussed the theory of heavy quarks, focusing on the important 
case of B physics. (All methods relying on the heavy-quark limit could in principle be 
applied to charmed hadrons as well, but they are in most cases much less reliable due to 
the smaller value of the charm mass.) We shall conclude by summarising the key points. 


e A crucial and general idea for dealing with QCD effects is the factorisation of short- 
distance and long-distance dynamics. We have encountered this principle in many 
different forms and applications: 


— The OPE to construct the effective weak Hamiltonians (244! ?) factorises 


the short-distance scales of order My, m, from the scales of order mọ. 


— The heavy-quark scale m treated as a short-distance scale can be factorised 
further from the intrinsic long-distance scale of QCD, Agcp. This leads to a 
systematic expansion of observables simultaneously in 1/m and o,(m) with 
often very important simplifications. The precise formulation of this class of 
factorisation depends on the physical situation and can take the form of HQET, 
LEET, HQE or QCD factorisation in exclusive hadronic B decays. 


» HQET exhibits the spin-flavour symmetry of QCD in the heavy-quark limit, which 
allows us to relate different form factors, and makes the mg dependence explicit. 


Examples of typical applications are B > Dv or fe. 


e HQE is a theory for inclusive B decays. It justifies the “parton model” and allows 
us to study the nonperturbative power corrections. This is of great use for processes 
as B > X.,lv, B > X,y, B > X,UT., and the lifetimes of b-flavoured hadrons. 


e QCD factorisation, finally, refers to a framework for analysing exclusive hadronic B 
decays with a fast light meson as for instance B > Dm, B > mm, B > 7K and 
B 5 V. 


With these tools at hand we are in a good position to make full use of the rich experi- 
mental results in the physics of heavy flavours. We can determine fundamental parameters 
of the flavour sector, such as Vas, Va, Via/Vis, n and sin 2a, and probe electroweak dy- 
namics at the quantum level through b — sy or B - B mixing. This will enable us to 
thoroughly test the standard model and to learn about new structures and phenomena 
that are yet to be discovered. 
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1 Introduction 


Lattice QCD was invented, way ahead of its time, in 1974. It really became a useful 
technique in the 1990s when a huge amount of progress was made in the understanding 
and reduction of systematic errors. Now, we are poised to start a second lattice revolution 
with the onset of Teraflop supercomputing around the world and further improvements 
in methodology. This will enable calculations using lattice QCD to reach errors of a few 
percent, over the next five years. At this level, lattice results, where they exist, will be 
the theoretical calculations of choice for the experimental community. 


It seems, then, a good time to review the fundamentals of lattice QCD, for an audience 
of experimental particle physicists. As ‘consumers’ of lattice calculations, it is important 
to be aware of how these calculations are done so that a critical assessment. of different 
results can be made. I have tried to keep technical details to a minimum in what follows 
but it is necessary to understand some of them, to appreciate the significance and the 
limitations of the lattice results that you might want to use. For a more detailed discussion 
see, for example (Gupta, 1998) or (Di Pierro, 2001). This school is largely concerned with 
CP violation and heavy quark physics, so in Section 4 I concentrate on lattice results 
relevant to these areas. 


2 Lattice QCD formalism and methods 


2.1 The path integral approach 


Lattice QCD is just QCD, no more and no less. We take the theory, express it in Feynman 
Path Integral language, and calculate the integral as well as we can. We would like to 
be able to do this in the continuous space-time of the real world, but this is not possible. 
Instead, we must break space-time up into a 4-d grid of points, i.e. a lattice (Figure 1), 
and evaluate the Feynman Path Integral by Monte Carlo methods on a computer. It 
turns out to be a calculation that requires a huge amount of computing power and tests 
the fastest supercomputers that we have. 
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Figure 1. A 2-dimensional rendition of a 3-dimensional cubic lattice. Lattice QCD 
calculations use a 4-dimensional grid. 


In the Feynman Path Integral approach, we first express the quantity that we want to 
calculate as the matrix element in the vacuum of an operator, ©, which will be a product. 
of quark and gluon fields so that, for example, O = (vv),(Vv);. creates a hadron at a 
point x and destroys it at a point y. We will discuss later other forms that O might take 
to calculate useful quantities. Then: 


[iA lati [aA] Oto, V. Aje 


(0161) = — — (1) 
f (e (ae (aA de 
where S is the action, the integral of the Lagrangian: 
S= J Late. (2) 


We are using Euclidean space here (imaginary time) so that the integrand doesn’t contain 
the oscillatory €'?, but the more easily integrated e75. The integral of Equation 1 can 
then be evaluated numerically if we can convert it to a finite-dimensional problem. 


Currently the integral runs over all values of the quark and gluon fields and A at ev- 
ery point in space-time. We need to make the number of space-time points (and therefore 
field variables) finite and we do this by taking a 4-d box of space-time and discretising it 
into a cubic grid, or lattice. It is then a relatively simple matter to transcribe the contin- 
uous theory onto the lattice, and we use the standard methods used for diseretising c.g. 
differential equations for numerical solution. Continuous space-time (x,t) becomes a grid 
of labelled points, (1;,2;) or (n;a, nga) where a is the spacing between the points, called 
the lattice spacing. The fields are then associated only with the sites, y(x, t) > Y(n, n). 
The action must also be discretised, but this is also straightforward. The Lagrangian 
tvpically contains fields and derivatives of fields. The fields are replaced with fields at the 
lattice sites and the derivatives replaced with finite differences of these fields. The integral 
over space-time of the Lagrangian becomes a sum over all lattice sites: (f d'a > Y, a"). 
There are inevitably discretisation errors associated with this procedure (just as there 
are for differential equations) because the lattice Lagrangian only matches the continuum 
Lagrangian at a = 0. At non-zero a there are effectively additional unwanted terms in 
the lattice Lagrangian that are proportional to powers of a. We will discuss this further 
later. Another view of the lattice is that it provides an ultra-violet cut-off on the theory 
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in momentum space, since no momenta larger than 7/a make sense (the wavelength is 
then smaller than a). In this way it is an alternative regularisation of QCD. 


As an illustration of the simplicity of the discretisation procedure, let us consider a 
scalar field theory with Lagrangian 


L= 5 (0.6 + sme + Af. (3) 


The lattice action, S is then 


2 


s-yw EDI + Sry aeta). (4) 


p=l 2 


The point n+ 1, is one lattice point up from the point n in the p direction. We are always 
free to rescale parameters and fields and we do this on the lattice, rescaling by powers 
of the lattice spacing, so that the parameters and fields we work with are dimensionless. 
Everything is then said to be in ‘lattice units’. In the scalar theory above we rescale to 
primed quantities where ¢' = $a, m' = ma, X = A. Then 


S-2y (sw |2 + jn" + ro" - iX ¢'(n + 1,)¢'(n — y) (5) 


The rescaling has the effect of removing the lattice spacing explicitly from the action. 
A lattice calculation is done then without input of any value for the lattice spacing, or 
even knowing what it is. We will discuss later converting results back from lattice units 
to physical units, so that we can compare results to the real world. Equation 5 has 
in addition been rearranged to collect similar lattice terms together, using 55, to move 
the space-time indices. It now looks very like a spin model, revealing a deep connection 
between lattice field theory and the statistical mechanics of spin systems. 


2.2 Lattice gauge theories for gluons 


To discretise gauge theories such as QCD onto a lattice requires a little additional thought 
because of the paramount importance of local gauge invariance. The róle of the gluon 
(gauge) field in QCD is to transport colour from one place to another so that we can 
rotate our colour basis locally. It should then seem natural for the gluon fields to ‘live’ 
on the links connecting lattice points, if the quark fields ‘live’ on the sites. 


The gluon field is also expressed somewhat differently on the lattice to the continuum. 
The continuum A, is an 8-dimensional vector, understood as a product of coefficients A? 
times the 8 matrices, 75, which are generators of the SU(3) gauge group for QCD. On the 
lattice it is more useful to take the gluon field on each link to be a member of the gauge 
group itself i.e. a special (determinant — 1) unitary 3 x 3 matrix. The lattice gluon field 
is denoted U,(n;, ne), where u denotes the direction of the link, nj, n, refer to the lattice 
point at the beginning of the link, and the color indices are suppressed. We will often 
just revert to continuum notation for space-time, as in U,(z). The lattice and continuum 
fields are then related exponentially, 


U, = es^. (6) 
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Figure 2. The gluon field on the lattice. 


where the a in the exponent makes it dimensionless, and we include the coupling, g, for 
convenience. If U, (x) is the gluon field connecting the points z and z + 1, (see Figure 2), 
then the gluon field connecting these same points but in the downwards direction must 
be the inverse of this matrix, U;'(x). Since the U fields are unitary matrices, satisfying 
U'U — 1, this is then Uf(z). 

This form for the gluon field makes it possible to maintain exact local gauge invariance 
on a lattice. To apply a gauge transformation to a set of gluon fields we must specify 
an SU(3) gauge transformation matrix at each point. Call this G(x). Then the gluon 
field U,(x) simply gauge transforms by the (matrix) multiplication of the appropriate G 
at both ends of its link. The quark field (a 3-dimensional colour vector) transforms by 
multiplication by G at its site. 


U(9 (a) = G(x)U,(x)G! (x + 1,) 
y(x) = G(x)b(2) 
BP (2) = wW(z)G'(z). (7) 


To understand how this relates to continuum gauge transformations try the exercise of set- 
ting G(x) to a simple U(1) transformation, e'*(29, and show that Equation 7 is equivalent 
to the QED-like gauge transformation in the continuum, A’ = A, — Ó,o. 


unm 


Figure 3. A string of gluon fields connecting quark and antiquark fields (left) and a 
closed loop of gluon fields (right). 


Gauge-invariant objects can easily be made on the lattice out of closed loops of gluon 
fields or strings of gluon fields (Figure 3) with a quark field at one end and an anti- 
quark field at the other, e.g. v(zi)U,(zi)U, (zi + 1,) -- -Uz2 — 1) (x2). Under a gauge 
transformation the G matrix at the beginning of one link ‘eats’ the Gt at the end of 
the previous link, since GiG = 1. The G matrices at x, and zr; are ‘eaten’ by those 
transforming the quark and anti-quark fields, if we sum over quark and antiquark colors. 
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The same thing happens for any closed loop of Us, provided that we take a trace over 
color indices. Then the G at the beginning of the loop and the Gf at the end of the 
loop, the same point for a closed loop, can ‘eat’ each other. (Try this as an exercise, re- 
membering that U fields going in the downward direction are U's and, from Equation 7, 
ULO (x) =G(r+ 1,)U{(x)G*(z).) 

The purely gluonic piece of the continuum QCD action is 


Some = | dag Te Fy F™ (8) 


and the simplest lattice discretisation of this is the so-called Wilson plaquette action: 


1 6 
S« = 8X (1— gRe(TrUp})s 8-75 (9) 


U, is the closed 1 x 1 loop called the plaquette, an SU(3) matrix formed by multiplying 


X 


Figure 4. A plaquette on the lattice. 


4 gluon links together in a sequence. For the plaquette with corner z in the 7, j plane we 
have (Figure 4): 
U,(z) = U;(z)U;(z + 1,)U (x + 1;)U} (z) (10) 


Tr in Siau denotes taking the trace of Up i.e. the sum of the 3 diagonal elements. Si 
sums over all plaquettes of all orientations on the lattice. 8 is a more convenient version 
for the lattice of the QCD bare coupling constant, g?. This is the single input parameter 
for a QCD calculation (whether on the lattice or not) involving only gluon fields. Notice 
that the lattice spacing is not explicit anywhere, and we do not know its value until after 
the calculation. The value of the lattice spacing depends on the bare coupling constant. 
Typical values of @ for current lattice calculations using the Wilson plaquette action are 
B x 6. This corresponds to a zz 0.1fm. Smaller values of @ give coarser lattices, larger 
ones, finer lattices. Other improved discretisations of the gluon action are also used. In 
these the bare coupling constant appears in a different way and so comparison of the 
bare coupling constant between different gluon lattice actions is meaningless. The only 
comparison which makes sense is that of the resulting values for the lattice spacing. That 
Stat, of Equation 9 is a discretisation of Scont is not obvious, and we will not demonstrate 
it here. It should be clear, however, from Equations 6 and 10 that Si does contain terms 
of the form 0,,A,. 


Sy is gauge-invariant, as will be clear from our earlier discussion. Thus lattice QCD 
calculations do not require gauge fixing or any discussion of different gauges or ghost 
terms. We simply calculate the appropriate Feynman Path Integral using Shau. Since we 
are only describing calculations for gluons at this stage, O will be some gauge-invariant 
product of U fields, for example the closed loop of Figure 3. Such a calculation is fully 
non-perturbative since the Feynman Path Integral includes all possible interactions in 
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the matrix element that we are evaluating. In contrast to the real world, however, the 
calculations are done with a non-zero value of the lattice spacing and a non-infinite volume. 
In principle we must take a — 0 and V — oo by extrapolation. In practice it suffices to 
demonstrate, with calculations at several values of a and V, that the a and V dependence 
of our results is small, and understood, and include a systematic error for this in our 
result. 


2.3 Algorithms 


The Feynman Path Integral (Equation 1) for gluons only becomes 


f [dU] Oe- 5 
Tage > (11) 


To evaluate this integral we can generate random sets of U fields on the lattice and work 


out the result: 
Z O,e 5 
(0/OJ0) = Se (12) 
{U}, is a set of U matrices, one for each link of the lattice, and is called a configuration. 
O, is the value of O on that configuration (e.g. the trace of a closed loop of Us). A set 
of configurations is an ensemble. 


(0/010) = 


This is a very inefficient way of working. If S, is large for a particular configuration 
it contributes very little to the result. Instead it is better to generate the configurations 
with probability e75. This is called ‘importance sampling’ since we preferentially choose 
configurations with a large contribution to the integral. If we have a set of configurations 
so distributed then 


(lol) = (0) = x È On (13) 


i.e. the result simply becomes the ensemble average of the value of the operator © evalu- 
ated on each configuration. The calculation then has a statistical uncertainty associated 
with it, which varies with the ensemble size, N, as 1/ VN. 


Several algorithms exist to generate an ensemble of configurations with distribution 
e75. The Metropolis algorithm is the earliest and simplest, but shares several features with 
later more sophisticated algorithms. The first step is to generate a starting configuration, 
(Ui, e.g. by setting all the U matrices to the unit 3 x 3 matrix or by generating random 
SU(3) matrices. The algorithm then sweeps round the configuration, one U matrix at a 
time. For each U matrix a small change is proposed, i.e. a random matrix close to the 
unit matrix is generated which could multiply U. The change in S is calculated if this 
change to U were to happen. If S is reduced, the change is accepted; if not, it is accepted 
with probability e^ ^* (by comparing e-^* to a random number between 0 and 1). Once 
this has been done for every U € {U}, we have a new configuration, (U)5. We then 
repeat to obtain {U }; etc. Once we have an ensemble we can do any number of different 
calculations (often called ‘measurements’) on it for different operators O. Ensembles are 
the equivalent of experimental data sets created by collaborations of theorists. They are 
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often stored for years and re-used many times. Some ensembles are publicly available - 
see http://qcd.nersc.gov/ and http://www.ph.ed.ac.uk/ukqed/. 


An important point to note is that each member of an ensemble is generated from a 
previous member. The ensemble therefore has a (computer) time history. We have to 
worry about the ‘equilibration time’ and the ‘decorrelation’ (autocorrelation) time of the 
ensemble. The equilibration time is the number of sweeps required to reach a configuration 
typical of the distribution e~* that we are trying to create, i.e a configuration which has 
‘forgotten’ the starting configuration. The autocorrelation time is the number of sweeps 
it takes to generate a sufficiently different configuration that results can be considered 
statistically independent. The autocorrelation time can be determined from the sequence 
of results for O and will depend on ©. In general if O is an operator with large extent, 
e.g. a closed loop of U fields over many lattice sites, it will have a longer autocorrelation 
time than if © is a small loop. This is because the changes to a configuration spread 
out randomly from a point, one step per sweep. As we try to reach smaller values of 
a, closer to the continuous space-time of the real world, we expect a phenomenon called 
‘critical slowing-down’. This is because a given physical distance, say the size of a hadron, 
takes up many more lattice sites as a gets smaller. For an ensemble to decorrelate on 
this physical distance scale then requires more sweeps. This makes the numerical cost 
of reducing the lattice spacing at fixed physical volume far worse than the naive a* (see 
Figure 5). 


| 
E 


| 
E 


Figure 5. A given physical distance requires more lattice points to cover it as a 1s reduced. 


2.4 Quarks on the lattice 
2.4.1 The fermion doubling problem 


The inclusion of quarks in the lattice QCD action causes several difficulties related to 
their fermionic nature and makes lattice QCD calculations very costly in computer time. 


The so-called ‘fermion doubling’ problem is apparent even for free quarks, in the 
absence of any interaction with the gluon field. The continuum action for a single flavor 
of free fermions is 


S, = f dtc d(y"d, + my. (14) 
The obvious (so-called naive) lattice discretisation gives 


4 
: z py = 
onaane =a! ` E ) au etn Vect Tomis. (15) 
T pal 
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Figure 6. The doubling problem for lattice fermions. The sine curve shows the lattice 
quark inverse propagator in 1-d. The straight lines through p = 0 (solid) and through 
p — n/a (dotted) are those for a continuum quark. 


The problems become evident when we Fourier transform this and compare the lattice 
inverse propagator: 
sin p,a 


Gatt naive (D) = [27 a +m (16) 


to that obtained in the continuum from Equation 14, 


Gas (P) = (yup, + m. (17) 
The two are plotted for a massless quark in one-dimension in Figure 6 over one lattice 
Brillouin zone (momenta beyond +7/a are equivalent to those in this range). The lattice 
result looks continuum-like around p z 0, where the inverse propagator is close to zero. 
The lattice inverse propagator is also close to zero around p œ a/a, however. Since 
n/a and —7/a are periodically connected on the lattice, another continuum-like line can 
be drawn at this point (with opposite slope to the one at the origin). Thus in one- 
dimension, our lattice fermion contains two continuum-like fermions rather than one! On 
a 4-dimensional lattice we have 2^ fermions instead of one. The 15 excess fermions are 
called doublers. The doubling problem is clearly a consequence of the fact that the sine 
function appears in Equation 16 and this is because of the single derivatives in the Dirac 
action for a relativistic fermion, Equation 14. For a scalar particle (Equation 5) we would 
have a cosine instead, and no difficulty. 


2.4.2 Wilson quarks 


There are several approaches to the doubling problem. The most severe in terms of its 
effects, but currently the most popular for a lot of applications, is the Wilson quark action. 
In this the doublers are entirely removed, by adding a ‘Wilson’ term to the action which 
gives them a much larger mass than ma, so that they drop out of the physics. The term 
added is a double derivative so appears with an extra power of a (a5) in order to have the 
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same dimensions as the other terms in S?" (Equation 15): 


s" _ greve _ 52° Y p Ote, (18) 
4 — 
Ow, = > rt, Us + Vezin (19) 
u-l a 


where r is the Wilson parameter (almost always set to 1). The extra power of a in 
Equation 18 means that the correspondence between the lattice and continuum actions 
as a — 0 is not changed. However, if we look at the inverse propagator again, there is a 
difference. 


- - rÁ. 
Gw (p) = Graive + 75 » sin^(p,a/2). (20) 
H= 


If we substitute for G1, from Equation 16 and expand out the sin function around p z 0 
we get 
ra £ 
Gy (p) = Vy,py + M + 3 Nx. (21) 
v-l 

Comparing this to the continuum form (Equation 17) as a > 0, the r term will disappear 
and a fermion of mass m will have the right form. If instead we look at the doublers, we 
must expand around p = 7/a. If we call p the momentum difference between p and x/a 
and consider the case where p has only one component close to 7/a, and the others are 
close to zero, then 


~ "n 2 
GHD) = ul, t ma Tees (22) 


Now as a — 0, the mass of the doubler, m 4- 2r/a — oo. The doublers at other corners of 
the Brillouin zone pick up masses of 4r/a, 6r/a, 8r/a : check this as an exercise. Thus 
we are assured that our quark action describes only the one quark that we intended, but 
there is a price for this, as we shall see below. 


The Wilson quark action is converted to dimensionless units by a rescaling ay — v, 
leaving the quark mass parameter as a mass in lattice units, ma (previously called m’). 


sy = 2 fa. » [o = T)Uz4i, = (Yy + r)a] + (ma + ieva) : (23) 


It is conventional to define a ‘hopping parameter’ called x which is 1/(2ma + 8r) and so 
1/« plays the rôle of the quark mass. 4 is conventionally rescaled by V2« so that x moves 
to multiply the terms connecting the V field on different sites (thus allowing ‘hops’). If 
we now couple in a gluon field, the w field will become a 3(color) x4(spin) dimensional 
vector on each site. The gluon field must be included in such a way as to keep the action 
gauge-invariant. From our earlier discussion it is then obvious that U matrices must be 
inserted as a link between the 7 and v) fields when they are on neighbouring sites. The 
Wilson quark action is then conventionally written: 


Way t » d. — DU ea, — Ber, (e+ news " $n] O (24) 


T 
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The price we pay for using the Wilson quark action is that we break explicitly the chiral 
symmetry of continuum QCD. This is a symmetry of the derivative terms in Sy (Equa- 
tion 14) which allows us to rotate separately right- and left-handed components of the 
quark field. T'he spontaneous breaking of this symmetry gives us à massless pseudoscalar 
meson called the pion as a Goldstone boson and has other important consequences for 
particle physics. Chiral symmetry is broken explicitly by a quark mass (so that the real 
pion is not actually massless) but also, more seriously for the lattice, by the Wilson term. 
As a — 0, chiral symmetry will be recovered, but for real lattice calculations at non-zero a, 
the lack of chiral symmetry can cause difficulties for some calculations. 


One surprising feature of Wilson quarks is that it is still possible to get a massless 
pion even at non-zero a, when chiral symmetry is broken. However, we have to search 
for the value of 1/« at which it occurs—it is not simply the point 1/« = 8r, as it would 
be in the free theory, above. Lattice calculations of the mass of the pseudoscalar meson 
(Mps) must be done at various input values of « (see Section 3) for a given ensemble. 
A plot of Mg, against 1/« is then extrapolated to the point where Mps is zero. The 
value of « at this point is called Keritica and is the point at which the bare quark mass 
in the interacting theory is zero (but matrix elements will not necessarily show chirally 
symmetric behaviour). The bare quark mass in lattice units, ma, at other values of « can 
then be taken to be (1/2« — 1/2kcritical). 


Another problem for the Wilson quark action is the presence of large discretisation 
errors. The naive quark action has discretisation errors proportional (at lowest power) to 
a? because (see Equations 16 and 17) sin(pa)/a = p(1—p’a?/6+---). In the measurement 
of a hadron mass, the terms proportional to p?a? in the action will induce an error pro- 
portional to A?a? where A is some typical momentum scale inside the hadron in question, 
say 300MeV. For lattice spacing values we can reach, around 0.1fm (= (2GeV) ! when 
h = c = 1), this gives an expected error of order 2%. The Wilson term (Equation 18) 
that we added, however, is proportional to a, so that S = Sp" + O(a). Now hadron 
masses will have an error of typical size Aa, which could be 15% at a = 0.1fm. One can 
extrapolate this error away by doing calculations at several values of a but the size of the 
extrapolation adds uncertainty. 


Instead, we can ‘improve’ the quark action, by adding additional terms to counteract 
the errors at any order in a. This is equivalent to a higher order discretisation scheme for 
differential equations. For the Wilson quark action we can add the so-called clover term, 
making the clover, or Sheikholeslami- Wohlerti, action: 


1 Cs, KT 


4 


The standard discretisation of aF,, is as a set of 4 plaquettes arranged in a clover- 
leaf shape. If the clover coefficient, c,,, is chosen correctly then the clover action has 
leading order errors proportional to a? again. It is in the correct choice of this coefficient 
that the difficulties of discretising a field theory, as opposed to a standard differential 
equation, appear. We are trying to match QCD with an ultraviolet momentum cut- 
off of 1/a to QCD with an infinite momentum cut-off. Gluonic interactions with gluon 
momenta between 7/a and oo in the continuum must be accounted for on the lattice by 
a renormalisation of coefficients in the action. Thus the naive (tree-level) value of 1 for 
Csw is renormalised by an amount which depends on the QCD coupling constant at some 
momentum scale around 7/a. This momentum scale is typically quite large (for a = 0.1fm 


Sever — Sr — D OR T (25) 
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it is 6GeV) so that a perturbative calculation of cy, can work well. c, = 1 - cios (n/a) + 
c02(r/a) +--+. In fact it has been shown that a lot of the perturbative correction 
can be absorbed into a renormalisation of the U field by a factor called uo, and this is 
called tadpole-improvement (Lepage, 1993). Alternatively c,« can be determined within 
the lattice calculation itself (i.e. non-perturbatively) by insisting that some continuum 
relationship, broken by the discretisation errors, works on the lattice (Sommer, 1998). For 
Csw We can impose Ward identities from chiral symmetry, for example. This improvement 
programme for the lattice action can be carried further at the cost of introducing more 
coefficients that have to be determined by a match to continuum QCD. However, this 
must be compared to the cost of not improving the action, which requires calculations on 
very fine lattices to achieve small enough discretisation errors for the accuracy we require 
and is generally prohibitive. 


2.4.3 Staggered quarks 


Here we return to the naive quark action and ask, what was so bad about having 16 
quarks instead of 1? If we had 16 flavors of quarks of the same mass in Nature, the 
naive action might be fine. In fact we only have two quarks that might be considered 
degenerate, u and d. They both have masses of a few MeV. Although we do not believe 
that their masses are the same, the difference is much smaller than any other mass, and 
they are treated as degenerate in most lattice calculations at present. 


We can ‘thin’ the degrees of freedom of the naive lattice quark action by removing the 
4 spin degrees of freedom (which can be shown to be multiple copies of the same thing). 
The quark field, X, then becomes a 3(colors) x 1(spin) component object on a site and the 
staggered (Kogut-Susskind) fermion action is: 


- 1 
Sf = E3 SJE ma (Ulan, ue ipxa)emes]. — 0) 


Nz is +1 according to the formula nz, „ = (—1)* where k = Y,,.,, £v. This action describes 
16/4 = 4 quarks, now much closer to the real world, if we want to interpret the doublers 
as flavors. We might hope that if the 4 flavors do behave as 4 copies of the same thing 
we can reduce their effect by a factor of two or four (depending on how many degenerate 
flavors we want to simulate) by multiplication with the required factor at appropriate 
points (as we could in QCD perturbation theory). The 4 spin degrees of freedom for the 
4 flavors are made from the 16 components of the X field on a 2* hypercube, which is a 
complication if we need to separate out the flavors. The staggered action, however, has a 
remnant of chiral symmetry which ensures the very desirable feature that the quark mass 
(and the associated Goldstone boson pion mass) vanish at ma — 0. This behaviour gives 
the added benefit of making staggered quarks rather better behaved and computationally 
much faster to work with than Wilson-type quarks. 


The down-side of staggered quarks is again the discretisation errors. These are formally 
O(a?), just as for naive quarks, but some of the errors induce flavor-changing interactions 
and so are rather dangerous. In practice they produce a larger than expected effect 
for simple a? errors. A quark with momentum around 0 can be scattered to one with 
momentum around 7/a i.e a doubler, and therefore a different flavor, by the interaction 
of Figure 7. One of the results of this is that the 16 different pions (for 4 flavors) no 
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Figure 7. A flavor-changing interaction for staggered quarks on the lattice. 


longer have the same mass and only one of them has a mass which vanishes as ma — 0. 
Improvement terms have recently been developed which can be added to the action to 
reduce these interactions to a much lower level, and the masses of the different pions 
are then much closer together (Bernard, 2001, MILC collaboration). This makes the 
prospects for working with staggered quarks in lattice QCD calculations much better, 
and a lot more work with these quarks will certainly be done. 


2.4.4 Ginsparg-Wilson quarks 


A recent development has been a set of quark actions which maintain chiral symmetry of 
the action while still describing only one quark flavor, but at the cost of a very complicated 
lattice discretisation of the continuum derivative. This is then costly to implement. For 
example, in the domain-wall formulation an additional 5th dimension is required whose 
length, in principle, must go to infinity. A lot of work is being done to develop algorithms 
for these quark actions which may make them feasible in the long-term. In the meanwhile, 
they are already being used for calculations that really need chiral symmetry at finite 
lattice spacing, such as that of the CP-violating parameter in the K system, e’. 


2.5 Algorithms for quarks 
Another problem with handling quarks in lattice QCD is that they are fermions, obeying 


the Pauli Exclusion Principle, and therefore cannot be represented by ordinary numbers 
in a computer. We must do the quark functional integral by hand: 


f [dU] [dv] [di] e- 9*9 Mv = f [dU] det Me- 5 (27) 
where the form for the matrix M depends on the quark formulation and can be derived 


from the forms given above for the quark action (Equations 24, 25 and 26). The QCD 
action then becomes 


s= Ds ( E xe Tr (U;)) — hi (det M). (28) 


We now generate ensembles of gluon fields (only) with importance sampling based on this 
action. The standard algorithm for doing this is called Hybrid Monte Carlo . The second 
term is a very expensive one to include, because it requires frequent calculations of M^! 
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(various algorithms, such as Conjugate Gradient exist to do this) and M is a large matrix 
(4(forWilson) x 3 x V zz 2 x 10$ on a side). If this term is missed out for expediency 
(so that the action is just 5,) then we talk of using the ‘quenched approximation’. Most 
calculations in the past have been quenched (and most of the results I discuss later 
will be in the quenched approximation) but recently calculations using the full QCD 
action (‘unquenched’ or ‘with dynamical/sea quarks’) have been attempted and in the 
future we hope that the quenched approximation will become redundant. We can think 
of the In(det M) term as giving rise to a sea of quark/anti-quark pairs appearing and 
disappearing in the vacuum. For every quark flavor for which we have a separate matrix 
M we should in principle include a term of the form In(det M) in the dynamical quark 
action. However, it is only the production of light (u, d, s) quark/anti-quark pairs that we 
envisage having a significant effect for most of the quantities that we calculate. Dynamical 
lattice calculations are then done with N; = 2 for u, d dynamical quarks or 2+1 if s is 
included. 


Quarks must also be integrated out of the operators, ©. For O = (VV),(V),, the 
form mentioned earlier, which creates a meson at the point x and destroys it at the point 
y, then 


[ovana egee = fray Mz (Uy QM; qu] detMe-*. (29) 


M~' is the quark propagator from x to y on a given gluon configuration, obtained by 
solving Mz = b where b is a vector with a 1 at x (and a certain color and spin index) 
and 0 everywhere else. We have been explicit here about the flavor indices, which we 
have taken as u and d, although lattice calculations usually then assume that u and d 
are degenerate and therefore the two M^! factors are the same. However, if the hadron 
actually does contain two quarks of the same flavor then ‘disconnected’ pieces containing 
Mz 1 will appear, as well as the ‘connected’ pieces above. The color indices, a and b, are 
also explicit (and summed over) and make © gauge-invariant. The sums over spin indices 
have not been made explicit because in this case they follow the color indices (but see 
Section 2.6). On an importance-sampled ensemble (either quenched or unquenched) for 
this example we then have to calculate Trcoior spin (Mz 7) (Mr?) on every configuration 
and average over configurations. 


Calculating M^! is computationally expensive and gets harder as M develops small 
eigenvalues, which happens as ma — 0 (for staggered quarks) or x — Keri (for Wilson or 
clover quarks). Thus, even in the quenched approximation, we cannot actually calculate 
with quark masses close to those of real u and d quarks. Instead we work with heavier 
quarks and perform so-called chiral extrapolations to the chiral limit where u and d quarks 
would be (almost) massless. 


2.6 Relating lattice results to physics 


Above we have given an example for ©, which includes the creation of a valence quark 
and anti-quark at the point x and their destruction at the point y. This is a so-called 
hadron correlator or 2-point function on the lattice since it simply has a source and a 
sink, and is one of the simplest quantities to calculate. It is shown pictorially at the left 
of Figure 8, where the solid lines indicate the valence quark propagators, and the blobs 
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Figure 8. A graphical representation of two types of 2-point functions calculated on the 
lattice. Left, for a hadron mass calculation; right, for a decay constant. 


at the two ends indicate the creation and annihilation of the meson. A baryon would of 
course have 3 valence quark propagator lines. Usually we project. onto specific values of 
p for the hadron, so in Figure 8 we have suppressed spatial indices at the source and sink 
and just refer to the time index, 0 at the source and T' at the sink. The figure shows 
how, as the valence quarks propagate, they interact any number of times by exchange of 
gluons. This is a pictorial representation of the fully non-perturbative nature of a lattice 
QCD calculation. The interactions include the production of dynamical quark/anti-quark 
pairs if a dynamical calculation is being done. 


'The calculation of this 2-point function will enable the extraction of the hadron mass 
(see below), for the hadron corresponding to the JPO quantum numbers of O. We make 
different quantum numbers by inserting y matrices between the v» and v fields in each 
piece of ©. For example, (51), creates a pseudoscalar meson (such as 7) and v3; 
a vector (such as p). When the quark functional integral is done, as in Equation 29, y 
matrices will appear between the two M^! factors and appropriate sums over spin indices 
will have to be done. 


The blobs in Figure 8 indicate that we can use more complicated forms for O for a 
given hadron, e.g. the ~ and v fields do not both need to taken at the point x. We 
can separate them spatially, either by inserting U fields to keep O gauge-invariant, or 
by fixing a gauge to allow spatial separation without including U fields. This enables 
us to feed in information, or prejudice, about the relative spatial distribution of the 
quarks in the hadron, i.e. its ‘wavefunction’. Each piece of Ø takes the form v, ,,ó(r)v, 
(suppressing the U fields) where ¢ is some function of the separation between v and v: 
it is known as the ‘smearing’ function and © is then a smeared operator. When the 
quark functional integral is done, factors of ¢ will appear between the M^! factors. The 
factor of $ is absorbed at the source by solving Mz = ¢ for, say, the quark (making a 
‘smeared quark propagator’) and Mz = 6 for the anti-quark (a ‘local quark propagator’). 
The two propagators are then put together with an explicit insertion of $ at the sink. 
Often calculations measure separately hadron correlators with several different smearing 
functions at both source and sink, enabling a more precise determination of the hadron 
mass. 


Another type of 2-point function is shown on the right of Figure 8. In this case we 
create the hadron with a smeared operator and destroy it with a local operator. This is a 
‘smeared-local’ or ‘smeared-current’ correlator, since the quantity that we can extract from 
this is the matrix element of the appropriate current operator, J, between the vacuum and 
the hadron. For example, this is used to calculate the decay constant, f,, related to the 
vacuum to 7 matrix element of the axial vector current (denoted by its time component, 
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Ao, in Figure 8). This couples to the W particle and mediates the purely leptonic decay 
of a 7 meson. See the Lagrangian for the weak interactions in (Rosner, 2002), but note 
that the W particle is not included explicitly in lattice QCD calculations. © in this 
case then takes the form (Vrir Olr Ye (Uo s). where the first factor creates the pion 
with a smeared operator at x and the second destroys it with the time component of the 
local axial vector current. The quark functional integral converts this to the same type 
of quantity, with two factors of M^!, that we discussed above. 


J - Vo. Vi 


0 t T 


Figure 9. A graphical representation of a 3-point function (for semileptonic decay) 
calculated on the lattice. 


Figure 9 shows a lattice 3-point function appropriate to the semi-leptonic decay of 
a hadron. One of the valence quark lines emits a W and chauges to a different flavor. 
A new hadron is then formed with the spectator quark. The emission of the W can be 
represented by the insertion of a current on one of the valence quark lines. The Figure 
shows a vector current (with temporal component Vg and spatial component Vi) which 
contributes to the decay of a pseudoscalar meson to a pseudoscalar meson (e.g. B — D). 
We then have a (smeared) source and sink at 0 and T, and a (local) current insertion at f, 
ie. 3 points. When the quark functional integral is done there will be 3 factors of M^, 
one for the original valence quark which decays (from 0 to t), one for the final valence 
quark (from ¢ to T) and one for the spectator (from 0 to T). In fact tlie most efficient 
way to do this calculation is to solve for the final valence quark propagator from T to t, 
taking as a source the spectator quark propagator from 0 to T. 


3 Lattice QCD calculations 


3.1 The steps of a typical lattice calculation 


Step 1 


A volume and a rough lattice spacing are chosen. A volume of (3fm)? is considered to 
be large enough not to 'squeeze', and therefore distort, typical hadrons placed on it. 
The time extent is usually taken as twice the spatial size since masses etc are extracted 
from the time dependence of hadron correlators (see below). The selection of the lattice 
spacing is a trade-off between getting close to the continuum limit (and therefore small 
discretisation errors) and the cost of the calculation, which grows as some large power of 
a~'. Improvement of the action, discussed above, helps here by giving small discretisation 
errors on coarser lattices. Lattice spacings around 0.1fm are reasonable on both counts. 
From experience we know roughly what value of the bare QCD coupling constant to take 
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in the gluon part of the QCD action to achieve various values of a (determined after the 
calculation, see below). However, the quark contribution to the action affects this also, 
and we have much less experience with this. A (3fm)? x 6fm lattice with a zz 0.1fm 
requires (30)? x 60 sites. 


Step 2 


À quark formulation, number of quark flavors, and masses in lattice units, ma, are chosen 
for the quark part of the QCD action. Again we have a trade-off between trying to take 
realistically small masses for the u and d quarks, and the cost. Again we do not know 
what the quark mass actually is until after the calculation, when we have calculated the 
masses of hadrons containing that quark. Recent calculations have been able to take 
dynamical quark masses down to the s quark mass and some have gone further; future 
calculations need to reach much smaller masses than this. Extrapolations to u and d 
quark masses will continue to be necessary, however (see step 8). Some interpolation will 
always be necessary too since the masses chosen will inevitably not be exactly correct, 
e.g. for the physical strange quark mass. 


Step 3 


An ensemble of gluon configurations must then be generated using importance sampling 


with e75. As discussed above, dynamical quarks appear implicitly through the quark 
determinant. 
Step 4 


Quark propagators are calculated on each gluon configuration of the ensemble by inverting 
the quark matrix, M, to make the 'valence' quarks inside the hadron. Where they are 
supposed to have the same flavor as the dynamical quarks, they should have the same mass 
in lattice units, ma. However, we can also calculate valence quark propagators for quarks 
with different mass from the dynamical quarks, and perform separate extrapolations in 
valence and dynamical quark masses. This is sometimes useful and particularly so if there 
is a very limited set of dynamical quark masses. It is known as the partially quenched 
approximation (PQA). 

Step 5 


The quark propagators are then put together in various combinations to form hadron 
correlators (see the discussion of the form taken for operators, ©, above) which are 
then averaged over all the configurations in the ensemble. We are concentrating here on 
operators O which are related to quark-based hadrons but gluonic operators can also be 
measured on the ensemble and averaged in the same way. 


Step 6 


The hadron correlators are fitted to their expected theoretical form to extract hadron 
masses and matrix elements. For the 2-point function described for the spectrum, the 
ensemble average of the product of smeared quark propagators described above gives us 
the vacuum expectation value of a hadron correlator, (0|H*(T) H (0)|0) (see Equation 1). 
The hadron creation(destruction) operator can create(destroy) from the vacuum all the 
hadron states which have the same JPC quantum numbers as the operator. For example, 
if the operator has the quantum numbers of a pseudoscalar meson containing u and d 
quarks, the 7 and all its radial excitations can be created(destroyed). The amplitude, A, 
with which a particular state is created or destroyed depends on the overlap with that 
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Figure 10. The effective mass of a 177 bb (Y ) correlator, calculated from a lattice 2-point 
function with local source and sink. 


state of the operator used, i.e. the smearing function, at source or sink. Thus we obtain 


(our (m) (0)|0) = Y; deese er (30) 


where the factor e^ arises because the two hadron operators are offset by a time dis- 


tance T in Euclidean space, and E, is the energy of the nth state. The states which 
dominate the fit, especially at large values of T', are those with lowest energy; if a projec- 
tion on zero momentum has been done, these will be the states with lowest mass. Often 
we are interested in the one state with lowest mass, the ground state (the 7 in the example 
above), and then try to design a good smearing function to have large overlap with that 
state, and very small overlap with its radial excitations. In that case fits can be done in 
which data at small values of T are thrown away and only a single exponential is used in 
the fit. The extent to which this works can be gauged by plotting the ‘effective mass’, the 
log of the correlator at time ¢ divided by t. If one state completely dominates the fit, a 
constant result is obtained as a function of ¢ - the effective mass is said to ‘plateau’. The 
plateau value is the ground state mass. Figure 10 shows the result for the effective mass 
of the correlator for an Y particle (see Section 4) calculated on the lattice. À clear plateau 
is seen but only for t > 15. For smaller ¢ the correlator clearly contains excitations of 
higher mass, because no smearing was used in this case. 


The best calculations use several different smearing functions at source and sink and 
perform simultaneous multi-exponential fits of the type in Equation 30. If the masses 
of several states can be obtained from the fit the reliability of the ground state mass is 
increased. It should also be pointed out that correlated fitting techniques must be used 
since the correlators at adjacent times are not statistically independent of each other. 


For the 2-point function used to calculate decay constants, the amplitude with which 
the hadron is destroyed at the sink is the vacuum to hadron matrix element of the current. 


OEO = 3; Aes) rar (3) 


n 
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and (O0|Ao|r(p = 0)) = frm,. To isolate the part proportional to the decay constant 
requires dividing the total amplitude of the ground state exponential by Agonag.s,. This 
can be obtained from a fit of the type in Equation 30, if the same smearing function is 
used at source and sink so that Agron = Asa. 

For the 3-point function we have two sets of hadrons with different flavor quarks, 
separated by a current insertion. 


Asre,nAsn m m J n) . - — 
(ucro = ZE feme eeo o (m 


where n runs over hadrons with the quantum numbers of the operator at 0 and m, those 
of the operator at T. Again the matrix element of interest, that of the current between 
two hadrons (usually for the ground states in the two cases), can be obtained by dividing 
out the amplitudes at source and sink from two separate 2-point fits for the two different 
hadrons of the kind in Equation 30. 


Step 7 


It is now possible to determine what the lattice spacing was in the simulation. This then 
sets the single dimensionful scale so that everything can be converted to physical units 
(GeV) from lattice units. The lattice spacing is determined by requiring one dimensionful 
quantity to take its real world value. Usually a hadron mass is chosen, because these 
are easiest to determine on the lattice, but it should not be one whose mass depends 
strongly on valence quark masses to be determined in the next step (see below) otherwise 
a complicated iterative tuning procedure will result. The most popular quantity to use at 
present is known as rg, a parameter associated with the potential between two infinitely 
heavy quarks. It is extracted from the energy exponent of a gluonic operator (the closed 
loop of Figure 3), so can be precisely determined and does not contain any valence quark 
masses. The only problem is that it is not an experimentally accessible quantity, and 
we rely on potential model results to give a phenomenological value, estimated to be 
0.5fm. Another quantity frequently used is the mass of the p meson, obtained by chiral 
extrapolation to the point where the 7 meson mass, and therefore the u,d quark mass, 
is (almost) zero. The chiral extrapolation, however, can produce large errors. A better 
quantity is the orbital excitation energy, i.e the splitting between P states and S states, in 
bb or cé systems, since these don’t contain light quarks and this splitting is even insensitive 
to the heavy quark mass. (The treatment of heavy quarks on the lattice will be discussed 
in Section 4.) 

Step 8 

The step above yields all hadron masses in GeV. However, before we can compare to 
experiment we must tune the quark masses. This requires calculations at several different 
values of the bare quark masses in an appropriate region. For each quark mass we then 
select a hadron whose mass will be used for tuning (and is therefore not predicted). For 
that hadron we interpolate/extrapolate the results to find the bare quark mass at which 
that hadron mass is correct. The masses of other hadrons containing that quark are 
then predicted if we interpolate/extrapolate those masses to the same quark mass, or 
combination of quark masses. In the process we learn about the dependence of hadron 
masses on the quark mass and this can be useful theoretical information. The hadrons 
used for tuning should be low-lying states with accurate experimental masses which can 
be calculated precisely on the lattice. The m mass is usually used to fix the u, d mass 
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(taken to be the same), although sometimes the approximation m, = 0 is used. The mass 
of the K, K*, or ¢ can be used to fix the s quark mass. The K or K* obviously require 
the u and d masses to have been fixed. The dimensionless ratio of the K* to the K mass 
can also be used, and this is then less dependent on the quantity used to fix the lattice 
spacing. For the c(b) quarks, the D(B), D,(B,) or (Y) systems are convenient ones. 


The interpolation/extrapolation of hadron masses as a function of bare quark masses 
is a relatively simple procedure in the quenched approximation. Then there is no feedback 
from the quark sector into the gluon sector. We can create gluon field configurations at 
a fixed value of the lattice spacing (as determined, for example, from a purely gluonic 
quantity such as rg) and measure hadron masses at many different quark masses on those 
configurations. The issues are then the correlations between results at different quark 
masses that must be taken into account and the spurious non-analytic behaviour in quark 
mass that can arise in the quenched approximation in extrapolations to u and d masses 
(‘quenched chiral logarithms’). 


When we include dynamical quarks in the calculation, the effects of the quark deter- 
minant at a particular quark mass feed into the gluon field configurations. Results at 
different dynamical quark masses then represent a completely new calculation, generating 
a new ensemble of gluon configurations with statistically independent results. The inter- 
polations/extrapolations in quark mass take on a new dimension and there are subtleties 
associated with how to do this. Some groups have chosen to generate configurations at 
fixed bare coupling constant and various dynamical bare quark masses. Then the lattice 
spacing will vary with quark mass and extrapolations in quark mass must be done in 
lattice units, before fixing the lattice spacing at the end. I believe a more satisfactory ap- 
proach from a physical perspective is to adjust the bare coupling constant at different bare 
quark masses so that the lattice spacing remains approximately the same (as determined 
from ro, for example). This then allows interpolations/extrapolations for physical hadron 
masses, and a better picture of the physical dependence of quantities on the presence of 
dynamical quarks. Several groups have also carried out this procedure. 


In all of these approaches we must extrapolate to reach the physical u/d mass region, 
and so we need to know the appropriate functional form for this extrapolation. This 
can be derived for light enough u/d mass using an effective theory of Goldstone pions 
called chiral perturbation theory. This shows that logarithmic behaviour of quantities as 
a function of the r mass (the variable representing the u/d quark mass) should be present 
in general as well as simple power-law behaviour. These ‘chiral logarithms’ will only show 
up at rather small quark masses (mua X m,/4) and so it is important for dynamical 
simulations to reach quark masses low enough to be able to match on to this behaviour 
and extrapolate down. 


Step 9 


The calculation needs to be repeated at several values of the lattice spacing to check that 
the dependence of physical results on the lattice spacing is at an acceptable level and/or 
to extrapolate to the continuum limit a = 0. Extrapolations again obviously require 
knowledge of an appropriate functional form. 


Step 10 


Compare to experiment or give a prediction for experiment! 
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Concluding remark 


Above we have described an ideal situation. Lack of computer power has meant compro- 
mising on one or more aspects in existing calculations. A lot of calculations have used the 
quenched approximation. More recent dynamical calculations have used heavy dynamical 
masses on rather coarse and sometimes rather small lattices. These difficulties should 
be overcome in the next few years and this will represent a huge improvement in the 
reliability of lattice results. 


3.2 Control of lattice systematic errors 


We aim for errors of a few percent from future lattice calculations. This requires both 
improved statistical errors in general and good control of systematic errors. Improved sta- 
tistical accuracy is obtained by generating larger ensembles of configurations with a cost 
proportional to the square of the improvement. Improved systematic accuracy requires 
theoretical understanding of the sources of error and how to remove them. It is this under- 
standing, described below, that has been responsible for the development of good lattice 
techniques and the convergence of lattice results in the quenched approximation through 
the late 1990s. This must be carried further in the next phase of dynamical simulations 
to reach the goal of providing quantitative tests of QCD and input to experiment. 


3.2.1 Discretisation errors 


As discussed earlier, these arise from errors in the lattice form of the Lagrangian, and 
operators ©, compared to the continuum versions. Lattice results, even when converted 
to physical units, have some dependence on a. This will be as a power series in a, starting 
at a”. As discussed earlier, n = 1 if the Wilson quark action is used, 2 for the clover 
quark action and 2 for the staggered quark action. n is also 2 for the Wilson plaquette 
gluon action of Equation 9. We expect the size of the a dependence to be controlled by a 
typical momentum scale relevant to the quantity being calculated. Quantities sensitive to 
shorter distances than others will be more susceptible to discretisation errors, even though 
the value of n depends only on the action used. Improved gluonic and quark actions are 
available in which higher order terms are added to £ to increase n, and therefore reduce the 
a dependence, and these can be tested for their efficacy in the quenched approximation. 
The systematic improvement method is known as Symanzik improvement (Gupta, 1998). 


Figure 11 shows a scaling plot of the vector meson mass (the p, except that the quark 
mass is heavier than the real u, d mass) in GeV versus the lattice spacing for various quark 
actions (Toussaint, 2002). Some of the calculations use an improved gluon action, with 
discretisation errors reduced beyond O(a?), but others use the Wilson plaquette action. 
There is very little difference between these (compare fancy diamonds and squares) so 
that most of the difference arises from the quark action used. A variant of ro, called 
Tı, is used to set the lattice spacing so the vector mass and scale are given in units of 
ri. The plot shows results for clover quarks (improved Wilson quarks), staggered quarks, 
improved staggered quarks and Ginsparg-Wilson (domain wall) quarks. The last two 
formulations, which are both improved to remove O(a?) errors show an impressively flat 
line, i.e. very little a dependence for this quantity. The clover quarks shown here have 
a clover improvement coefficient (see Section 2.4.1) chosen using tadpole-improvement. 
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Figure 11. A scaling plot in the quenched approzimation for the vector meson at a 
quark mass such that the pseudoscalar meson has mass m,r, = 0.807. The vector meson 
mass is given in units of rı, a variant of ro, where rj % 0.85fm = 0.57GeV-!. It is 
plotted versus the square of the lattice spacing, also given in units of rı. The squares and 
fancy diamonds use an improved gluon action; the others use the Wilson plaquette action. 
The quark actions used are: circles, staggered (Kogut-Susskind); squares, improved stag- 
gered; fancy squares and fancy diamonds, tadpole-improved Wilson (clover); diamonds, 
Ginsparg- Wilson (domain wall). (Toussaint, 2002) 


This reduces the a dependence of Wilson quarks to o,a but it is clearly still visible. A 
non-perturbative determination of the clover improvement coefficient can reduce the a 
dependence further to O(a?), and then this formulation looks rather better. Notice the 
large discretisation errors visible for unimproved staggered quarks, despite the fact that 
the errors are O(a?) (and results therefore lie on a straight line in the Figure). Provided 
that all the different quark formulations have been fixed to the same physical quark mass, 
all the results for the vector meson mass should agree in the a — 0 limit. This does seem 
to be true, within the statistical errors shown. 


3.2.2 Finite volume 


Lattice results will be distorted if the space-time box in which the calculation is done is 
too small to adequately represent the infinite space-time volume of the real world. For 
large enough volumes the error should be exponential in the lattice size, x e^ M^, for a 
lattice of size L in physical units. This means that it is possible to reduce finite volume 
errors rapidly to zero by taking large enough volumes. The lightest particle is the 7, so 
this sets the volume required as we reduce the u, d quark masses to their physical values. 
For u, d quark masses of m,/4, m4 L > 5 for L > 3fm, giving a finite volume error of less 
than 196. Most recent lattice calculations have used volumes of this size, although there 
has been little systematic dependence of the volume dependence of results. 
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3.2.3 Matching hadronic matrix elements to the continuum 


The calculation of hadronic matrix elements of various currents, J, on the lattice is dis- 
cussed for 2- and 3-point functions in Section 2.6. An important point is that these depend 
in general on how QCD has been regularised and a finite renormalisation is then required 
to convert lattice results to those appropriate to a continuum scheme (such as MS). 
Since lattice QCD and continuum QCD differ in the ultra-violet (for momenta greater 
than 7/a), this renormalisation can be calculated in perturbation theory, by matching 
the matrix elements of J between quark states. We usually need several lattice currents 
to make up the continuum current and a mixing and matching calculation must be done. 


Jcont = Zod + aZ Jp te 
Zi 14+ cl a,(2/a) + cO)? (2/a) T (33) 


Lattice perturbation theory is done in the same way as continuum perturbation the- 
ory, in terms of the field A, and including gauge-fixing and ghost terms, if necessary. 
Relatively little lattice perturbation theory has been done up to now and few results exist 
beyond O(a,). This leaves errors of O(o2), 5-10% if we take a scale for a, of 2/a at 
a=0.1fm. Higher order calculations will be required to reduce this to the required level 
of 2-396, and techniques are being developed to do this. It is also sometimes possible to 
fix the normalisation of lattice currents non-perturbatively using symmetry arguments or 
to match numerically between lattice and continuum MOM-type schemes. In whatever 
way it is done, the matching of lattice matrix elements to the continuum is a lot of work 
and an area where improvements are still necessary. 


3.2.4 Unquenching 


The neglect of dynamical quarks in the quenched approximation is obviously wrong, but 
how wrong? For many years systematic errors from the quenched approximation were 
obscured by the size of the statistical and discretisation errors. Now improved quenched 
calculations are showing internal inconsistencies and disagreement with experiment which 
we believe will be removed once realistic dynamical calculations can be done. 


One effect expected in the quenched approximation is the incorrect (too fast) running 
of the coupling constant from one scale to another because of the absence of g > qq > g 
pieces in the vacuum polarisation to give quark screening of the color charge. From this we 
might expect that the determination of the lattice spacing would depend on the quantity 
used to fix it, since different quantities will be sensitive to different distance/momentum 
scales and these will not be connected correctly by the running of a, in the quenched 
approximation. (Using a quantity to fix a is equivalent to fixing the QCD coupling 
constant at the momentum scale relevant to that quantity). This is indeed found aud 
illustrated by the quenched point in Figure 12. Likewise hadron masses depend on the 
hadron used to fix the quark mass. Then if a set of hadron masses is studied, sensitive 
to a range of scales and containing different combinations of quarks, errors will show up 
(see Figure 13 (Aoki, 2000, CP-PACS collaboration)). 


The quenched approximation also does not allow the decay of particles where this 
requires the production of a jq pair from the vacuum, e.g p > 77. Once dynamical 
quarks are light enough for this to happen, it will in fact be difficult to determine m, 
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Figure 12. The ratio of inverse lattice spacings, a^! , obtained from the orbital excitation 
energy, the splitting between 1P and 1S states, in the T system and from ro. Results 
are given for quenched simulations and for dynamical simulations using two flavors of 
dynamical quarks at three different values of the quark mass, all heavier than m,, indicated 
by the square of the corresponding pion mass along the x axis. (Marcantonio, 2001, 
UKQCD collaboration) 
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Figure 13. The spectrum of light mesons and baryons obtained in the quenched approzi- 
mation after extrapolation to u,d quark masses and to the continuum limit. The p and v 
masses are missing since they were used to fix the lattice spacing and u,d masses. Results 
are compared using the K or the @ to fiz the strange quark mass and disagreement between 
the two is seen. The size of the discrepancy with experiment depends on this and varies 
between hadrons, but is at the level of 10%. (Aoki, 2000, CP-PACS collaboration) 
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Figure 14. The computer cost in teraflop-years of generating 500 30? x 60 configurations 
with a = 0.1fm and a dynamical quark mass which gives the ratio of pseudoscalar to 
vector meson masses along the x axis. Clover and improved staggered quarks are compared, 
assuming the same scaling behaviour as (m, /m,)?. The straight line shows what is possible 
in 6 months on a 5Tflops computer. 


since we will obtain instead the lighter mass of the two-pion system. It is then important 
in dynamical simulations to use hadrons which are stable in QCD, or have very narrow 
widths, to fix the quark masses in the QCD action. 


It has been stressed that the numerical cost of unquenched calculations is very high. 
It increases very rapidly as a is reduced at fixed physical volume and as m,,4 is reduced, 
although the exact scaling behaviour is not completely clear. Figure 14 estimates the cost 
of generating an ensemble of 500 gluon configurations on an L? x T lattice with L = 3fm 
and T = 2L at a lattice spacing, a = 0.1fm, as a function of the u, d dynamical quark 
mass. The z axis is plotted as the ratio m,/m, where the 7 and p are the pseudoscalar 
and vector mesons made with valence quarks of the same mass as the dynamical quarks. 
The real world has m,/m, = 0.2. For mua = m, the ratio is 0.7, for mua = m,/2, 0.55 
and for m,/4, 0.4. For m,/2 the ratio is obtained from the K and K* masses. For m, 
and m,/4 some arguments must be made about the scaling of hadron masses with quark 
masses because, for example, no pure ss pseudoscalar meson exists. The cost varies here as 
(m,/m,)*, which is based on estimates from simulations. Figure 14 compares the cost for 
clover quarks and improved staggered quarks, again based on simulations at one quark 
mass, and using the same scaling formula. The cost advantage of improved staggered 
quarks is clear on this plot. One disadvantage is that the algorithm generally used for 
two flavors of dynamical staggered quarks is not exact, unlike that for clover. This means 
that there are systematic errors, rather like discretisation errors, which increase with the 
computer time step, e, which is used to generate one gluon configuration from the previous 
one. Checks must to be done to make sure that this systematic error is at an acceptable 
level and/or an extrapolation to « = 0 must be done. 


Recent unquenched calculations, albeit with rather heavy dynamical quark masses, 
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Figure 15. The masses of the ọ and K* mesons as a function of lattice spacing, a, for 
quenched simulations and those using two flavors of dynamical quarks. The K meson is 
used to fiz the strange quark mass. The esperimental results are indicated by diamonds 
ata = 0. The Iwasaki improved gluon action was used with a clover quark action. (Ali 
Khan, 2002, CP-PACS collaboration) 


have shown encouraging signs that systematic errors from the quenched approximation are 
being overcome. Figure 12 shows that the ratio of a^! values obtained from two different 
quantities is closer to 1 on dynamical configurations (using two flavors of dynamical quarks 
with a mass around m,) than it was on quenched configurations (Marcantonio, 2001, 
UKQCD collaboration). From this we can hope that with 2 dynamical light quarks and a 
dynamical strange quark there will be only one value of the lattice spacing, corresponding 
to the one dimensionful scale of QCD in the continuum. 


Figure 15 compares results for the masses of ¢ and K* mesons on quenched and 
unquenched configurations as a function of a. The K meson is used to fix m, and gives 
poor results for the K* and the $ in the quenched approximation, as described earlier. 
For two flavors of dynamical quarks, the K* and $ masses are much closer to experiment, 
at least after a continuum extrapolation (Ali Khan, 2002, CP-PACS collaboration). One 
worrying feature of this plot is the size of discretisation errors in the unquenched case, 
implying that the improved action used does not work very well unquenched. 


Figure 16 shows another quantity from light hadron physics that gives a problem in 
the quenched approximation. This is the difference of the squared vector and pseudoscalar 
masses for given quark combinations. Experimentally the result is very flat as a function 
of quark mass, being + 0.55GeV? from the 7, p to the D, D*. In the quenched approx- 
imation this quantity has a pronounced downward slope as the quark mass is increased. 
Recent results from the MILC collaboration with 2 (m,/4) + 1 (ms) flavors of dynamical 
improved staggered quarks show qualitatively different behaviour, much closer to that 
of experiment (Bernard, 2001, MILC collaboration). This is the strongest indication yet 
that calculations with dynamical quarks will overcome the disagreements between the 
quenched approximation and experiment. 
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Figure 16. The difference of the squares of the vector and pseudoscalar masses for 
various light hadrons, obtained with quenched and dynamical lattice QCD. The dynamical 
results have 2+1 flavors of dynamical quarks with masses zz m,/4 and m,. The dynamical 
results are given only for valence quark masses equal to the dynamical ones. Experimental 
results are given by the bursts, using an estimated mass for the pseudoscalar s8 meson. 
The lattice spacing has been obtained using ro = 0.5fm. Errors do not include errors from 
fixing the lattice spacing (Bernard, 2001, MILC collaboration). 


4 Lattice QCD results 


The Proceedings of each year's lattice conference provide a useful summary of current 
results and world averages. See (LAT2000, LAT2001). Almost all lattice papers can 
be found on the hep-lat archive, http://arXiv.org/hep-lat/. I have deliberately chosen 
to refer to reviews where possible and these should be consulted for fuller access to the 
literature. 


4.1 Methods for heavy quarks 


Bottom and charm quarks are known as heavy quarks since they have masses much greater 
than the typical QCD scale, Agcp, of a few hundred MeV. Top quarks are also heavy, 
of course, but do not have interesting bound states so are not studied by lattice QCD. 
b and c quarks could be treated in the same way as u, d, or s quarks on the lattice except 
that, with current lattice spacings of about 0.1fm, we have m,a in the interval 2-3 and 
Mea in 0.5-1. If ma is not small then discretisation errors of the form ma, (ma)? etc. 
will not be small either and such an approach will not give accurate results. Relativistic 
momenta, p z m, can also not be well simulated if pa is not small: pa of O(1) corresponds 
to wavelengths which are in danger of being small enough to 'fall through' the holes in 
the lattices. 


To reach the very fine lattices that would be required to give mpa « 1 and accurate 
simulations for 6 quarks would require an amount of computing power way bevond our 
current hardware even in the quenched approximation. Luckily the physics of heavy 
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quark systems in the real world means that we do not have to do this; indeed, it would 
be largely a waste of computer power. b and c quarks are non-relativistic in their bound 
states, so that m and p % m are irrelevant dynamical scales. The non-relativistic nature 
is evident from the hadron spectrum. "There are heavy-heavy bound states in which 
both the valence quark and anti-quark are heavy (Y, y and Be) and heavy-light bound 
states in which the heavy (anti-)quark is bound to a light partner (B, B,, D, D,) or 
partners, in the case of baryons (Ay, Ae). In all cases the mass difference (splitting) 
between excitations of these quark systems is much less than the mass of the hadrons. 
For example m(Y')—m(Y) = 560MeV, m(Y) = 9.46GeV. The internal dynamics, which 
controls these splittings, operates with scales much smaller than the quark mass. Instead 
the important scales are the typical momentum carried by the quark inside the bound 
state, mv, and the typical kinetic energy, imv. 'That these scales are small compared 
to m implies that v/c « 1. The use of non-relativistic techniques on the lattice is then 
a good match to the physics of b and c systems as well as providing an efficient way to 
handle them numerically on the lattice. 


There are several ways to proceed, and it is important when reading the lattice lit- 
erature to understand which method has been used. In the remainder of this section we 
consider three methods in particular: (a) static quarks, (b) NRQCD (a non-relativistic 
version of QCD) and (c) heavy relativistic quarks. 


Static quarks 


'This is the m — oo limit of heavy quarks. In this limit Heavy Quark Symmetry holds and 
quarks become static sources of colour charge with no spin or flavor. This is evident on 
the lattice as the quark propagator becomes simply a string of gluon fields along the time 
direction (Eichten, 1990). Obviously no real quarks have infinite mass but this is a useful 
limit for studying heavy-light systems. Corrections away from the infinite mass limit are 
the subject of Heavy Quark Effective Theory (Buchalla, 2002). 


NRQCD 


NRQCD is a non-relativistic version of QCD (Lepage, 1992). The Lagrangian for heavy 
quarks is the non-relativistic expansion of the Dirac Lagrangian: 


D? a-B 
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(34) 


where additional terms can be added to go to higher order in u/c. 1? is now a 2-component 
spinor since the quark and anti-quark fields of the Dirac fields decouple from each other. 
D is a covariant derivative, including coupling to the gluon field. B is the chromomagnetic 
field, related to space-space components of the field strength tensor, B; = ei Fj”. mg is 
the quark mass; heavy quarks are frequently generically denoted Q in contrast to the q 
used for light quarks. Notice that the quark mass term wmgaw has been dropped. This 
simply redefines the zero of energy so that the energies of all hadrons determined on the 
lattice are less than 1. 


The NRQCD Lagrangian can be discretised onto a lattice and leads to much simpler 
and faster numerical algorithms for calculating the quark propagator than for light quarks. 
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Instead of having to explicitly invert a matrix using an expensive iterative procedure such 
as Conjugate Gradient, the propagator is simply calculated by stepping through the lattice 
in time and calculating the propagator at time t from that at time £ — 1. This is simply 
illustrated if we look at the Lagrangian in the infinite mass limit, where it becomes the 
Lagrangian for static quarks. Only the first term above contributes and we have: 


Sg = V(x) (U(z)u(z + 1) - ¥(2)) (35) 


M is then an upper triangular matrix, using the notation of Equation 27, and the quark 
propagator is given by: 
(Mois) = Ul (Mo). (36) 


The general start and end points, x and y, are simply denoted here by their ¢ co-ordinates, 
0 for the origin and £ for the end point. To move from end point £ to ¢ + 1 just requires 
multiplication by the appropriate U field in the time direction, so M~' does not change 
spatially and becomes a string of U fields as described for static quarks above. For 
NRQCD with non-infinite masses, the evolution equation in t for the propagator is not as 
simple and does contain spatial variations (e.g. from the spatial covariant derivatives in 
Equation 34) but the same principles apply. A smearing function, ¢, is chosen at the time 
origin and then the propagator calculated from 0 to later times by an evolution equation 
from one £ to the next. This makes NRQCD numerically very attractive. Heavy quark 
propagators, once calculated, can be combined together or with a light quark propagator 
to make 2- and 3-point functions for heavy hadrons as described for light hadrons earlier. 
As described there also, the value for the bare heavy quark mass in lattice units, mga, is 
adjusted, given a value for a, until a heavy hadron mass is correct in GeV. The energies 
of heavy hadrons calculated on the lattice do not in fact equate directly to their masses 
because the mass term was removed from the Lagrangian. Instead, for one heavy hadron 
we have to calculate an energy-momentum dispersion relation and derive its mass from 
the momentum dependence (E x p?/2M). 


NRQCD is an effective theory, containing the right physics for low momentum heavy 
quarks. Adding more relativistic corrections to the Lagrangian can make this more ac- 
curate. These higher order terms appear with coefficients (such as c in equation 34) 
which must be determined by matching to relativistic QCD. These coefficients represent 
the effect of relativistic momenta missing from NRQCD and they are governed by a, at 
this high momentum scale and so are perturbative. High momenta for both quarks and 
gluons are missing anyway on the lattice because of the discretisation of space-time. We 
described earlier how a better match between lattice QCD and QCD is made by adding 
terms to the lattice QCD Lagrangian which are higher order in a, with a coefficient which 
depends on the strong coupling constant at the lattice cut-off scale. That the two pro- 
cedures are very similar is not an accident; indeed, the same higher dimension operators 
appear in both cases. In this case NRQCD is simply making a virtue of the existence 
of the lattice cut-off. The difference is, however, that in the NRQCD case the operators 
appear with inverse powers of mga (in a dimensionless lattice notation) and so mga, and 
therefore a, cannot be taken to zero in this approach. NRQCD has no continuum limit, 
but this does not prevent physical results being obtained at finite lattice spacing. It is 
just necessary to show that the results are sufficiently independent of a over a range of 
values of a. 
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Heavy relativistic quarks 


This method looks very different from NRQCD, but has a lot of features in common. 
The use of a relativistic action, such as the Wilson/clover action, for heavy quarks on a 
lattice does not have to be incorrect if the results are interpreted carefully (El-Khadra, 
1997). The main point to realise is that the existence of a large value for ma breaks the 
symmetry between space and time. The inverse quark propagator in momentum space has 
an energy at zero momentum very different from its mass (e.g. for a free Wilson quark, 
E(p = 0) = In(1+ ma)) but its momentum dependence for small momenta is correct 
(i.e. as p?/ma). Thus, we can ignore the ma errors in the energy if we fix masses from 
the energy-momentum relation as for NRQCD. For more precision we must add higher 
order discretisation/relativistic corrections. These will appear with coefficients chosen to 
match continuum relativistic QCD. As we have seen the coefficients are a power series in 
a; at the cut-off scale and they will depend on ma. For small ma the coefficients will be 
those of a discretisation correction to the action; for large ma they will go over to the 
NRQCD coefficients. For example, the c,,F,,, clover term corrects for an O(a) error in 
the Wilson action for light quarks; for heavy quarks, it becomes the relativistic correction 
which couples the quark spin and the chromomagnetic field. In this way an action can 
be developed that smoothly interpolates between heavy and light quark physics, at the 
numerical cost of having to handle heavy quarks in the same way as light ones. This 
method is sometimes known as the Fermilab method, since it was pioneered there. 


The charm quark mass is not very heavy on the finest of current quenched lattices, 
and some groups have taken the standard relativistic approach in this case. To reach the 
b quarks then requires an extrapolation jointly in the heavy quark mass and the lattice 
spacing (Maynard, 2002, UKQCD collaboration) to avoid confusing discretisation and 
relativistic corrections. Such an extrapolation inevitably has rather large errors. A better 
approach is to consider a formalism which explicitly breaks space-time symmetry in order 
to restore the relativistic energy-momentum relation for heavy quarks. For example, you 
can take an anisotropic lattice which has a much finer spacing in the time direction than 
in the space directions. ma, is then small and the heavy quark looks like a light one, 
at the cost of having many more timeslices on the lattice, and having to determine the 
lattice spacing in both directions (Chen, 2001). 


4.2 The heavy hadron spectrum 


The spectrum of heavy-heavy states has largely been the province of NRQCD (Davies, 
1998). Figure 17(a) shows the radial and orbital excitations of the bb Y system, obtained 
both on quenched gluon configurations and those with two flavors of dynamical quarks 
(Marcantonio, 2001, UKQCD collaboration). For these results the lattice spacing has 
been fixed by demanding that the splitting between the Y(15) and the spin-average of 
the P-wave (X,) states is correct. The b quark mass has been fixed by requiring that 
the Y(1S) mass be correct. It is only the 2S (Y"), 39 (Y") and 2P (X}) states that are 
predicted from this calculation, and they have rather large statistical errors at present. 
It is a general feature of lattice calculations that ground state masses are more precise 
than excited state masses. For both excited and ground states the noise is controlled by 
the ground state mass. For excited states the signal/noise ratio is then much worse and. 
becomes exponentially bad at large T. 
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Figure 17. (a) The radial and orbital excitations in the bb system, as calculated in lattice 
QCD using NRQCD for the b quarks (Marcantonio, 2001, UKQCD collaboration). (b) The 
fine structure of low-lying bb states. Legend: the horizontal dashed lines are experimental 
values; open circles show the quenched approzimation; solid circles correspond to 2 flavors 
of clover dynamcal quarks with mass m,. (The lowest cluster of points on the right show 
an extrapolation to lighter dynamical mass and to N;s=3. 


Of more immediate interest is the fine structure of the low-lying S and P states, shown 
in Figure 17(b). These can be determined very precisely on the lattice, particularly the 
‘hyperfine’ splitting between the spin-parallel vector Y state and the not-yet-seen spin- 
antiparallel pseudoscalar ry. A comparison with experiment, when it exists, for this 
splitting will provide a very good test of lattice QCD and our 5 quark action, which will 
be important for the lattice predictions of B matrix elements described in Section 4.3. 


The accuracy of the NRQCD, or other lattice action, for heavy-heavy bound states can 
be estimated by working out what order in an expansion in powers of u/c is represented 
by each term. e.g. the first two terms in the NRQCD action of Equation 34, i.e. the 
time derivative and the kinetic energy term, are both O(v?/c?). This is because the 
‘potential energy’ and kinetic energy terms are roughly equal for two heavy particles. 
These terms give rise to the radial and orbital splittings, and the ratio of these (= 500MeV) 
to half the T mass gives an estimate of v?/c? ~ 0.1 for b quarks in an Y. Higher 
relativistic corrections, such as the D*/8mj, term, are O(v* /c*) and should give roughly 
a 1096 correction to these splittings. These terms were included here, but not the v?/c$ 
corrections, so an error of roughly 1% remains. The ø +B term of Equation 34 is the first 
spin-dependent term and is O(v*/c*). It gives rise to the hyperfine splitting and a similar 
term of the same order, proportional to ø - D x E, gives rise to the P fine structure. 
The fine structure is indeed roughly 1096 of the radial and orbital splittings. Including 
only these terms in the NRQCD action, as was done here, implies an error of roughly 
1096 in these splittings. A more precise calculation, necessary to test this action against 
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Figure 18. The spectrum of cc states, as calculated in lattice QCD using anisotropic 
quenched configurations (Chen, 2001). 


experiment, will require the v9/c9 spin-dependent terms and the o,v*/c* terms implied 
by calculating the coefficient c in front of the ø -B term in equation 34. This is now being 
done. Figure 17(b) does show, however, that the hyperfine splitting increases when two 
flavors of dynamical quarks are included, and continues to increase as the dynamical quark 
mass is reduced towards real u and d quark masses. We expect the Y to see also s quarks 
in the vacuum and extrapolating the number of dynamical flavors to three increases the 
splitting further. 


The charmonium, 4%, system is more relativistic that the Y system and correspondingly 
less well-suited to NRQCD. Estimates as above give v?/c? z 0.3. Figure 18 shows the 
charmonium spectrum obtained from anisotropic relativistic clover quarks in the quenched 
approximation (Chen, 2001). The lattice spacing and charm quark mass were fixed in the 
analogous way to that described above, except that the spin average of the vector Jy and 
pseudoscalar 7, masses was used to fix me. Since the n, mass is known experimentally this 
gives improved precision since the spin-average is not sensitive to any inaccuracies in spin- 
dependent terms. The spectrum given in Figure 18 includes some gluonic excitations of 
the cc system, i.e. ccg states, called hybrids. Their existence is expected simply from the 
non-Abelian nature of QCD which allows gluons themselves to carry color charge. Some 
of these hadrons have exotic quantum numbers not available to mesons made purely of 
valence quarks, and the prediction of their masses will be important for their experimental 
discovery. 


Figure 19 shows the spectrum of mesons made from one 6 quark and one light (u/d 
or s) anti-quark in the quenched approximation (Hein, 2000). NRQCD was used for the 
b quark, and the clover action for the light quark. In this case the lattice spacing was 
fixed using a quantity from the light hadron spectrum, m,, because heavy-light systems 
are more similar in terms of internal momentum scales to light hadrons than heavy- 
heavy ones. See the comments in Section 3.2 on how the lattice spacing in the quenched 
approximation depends on the quantity used to fix it. The u/d and s quark masses were 
fixed using the * and K masses. The b quark mass was fixed from the spin-average of 
the B and B* meson masses. Taking a spin-average, as above for charmonium, avoids 
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Figure 19. The spectrum of bound states of ab quark with a light anti-quark as calculated 
in lattice QCD in the quenched approzimation using NRQCD for the b quark (Hein, 2000). 


any errors from spin-dependent terms in the action. The 6 quark mass obtained this 
way differs from that obtained above from the Y system, and is another feature of the 
quenched approximation. In the ‘real world’ there is only one lattice spacing and one set. 
of quark masses and parameters fixed from the Y system will be used to predict the entire 
B spectrum. 


The power counting in v/c for terms in the Lagrangian works rather differently in 
heavy-light systems compared to heavy-heavy ones. Now there is one quark that carries 
almost all the mass of the heavy-light system and it sits in the centre surrounded by the 
swirling light quark cloud. This picture makes sense even in the limit in which the heavy 
quark has infinite mass when the Lagrangian would contain only the covariant temporal 
derivative D, (static quarks). The higher order terms in the Lagrangian can then be 
ordered in terms of the inverse powers of the heavy quark mass that they contain. This is 
equivalent to an expansion in powers of v/c. The typical momentum of a heavy quark in 
a heavy-light system is O(Agcp) (as is that of the light quark) and so v/c = Agcp/mo. 
This gives v/c ~ 10% for the B and 30% for the D. 


Again the power counting exercise enables us to understand the approximate relative 
sizes of different mass splittings in the spectrum and the accuracy of our lattice QCD 
calculation to a given order in v/c. The leading spin-independent term in the action is D, 
giving rise to the orbital and radial excitations of ~ 500MeV. The kinetic energy term, 
D?/2mg gives a Aqcp/mq correction to this, which depends on the quark mass and, 
therefore flavor. This explains why these excitation energies are so similar for B and D 
systems; the similarity between 7 and Y is more accidental. The leading spin-dependent. 
term is a-B/2mq, which gives rise to fine structure such as the splitting between the 
pseudoscalar B and vector B*. This splitting should then be smaller by a factor of 
Aqcp/mq compared to the spin-independent splittings and this is indeed observed. To 
calculate this splitting precisely on the lattice requires the inclusion of higher order terms 
in the Lagrangian, as well as a better matched coefficient c for the ø - B term and this 
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Figure 20. A comparison of determinations of the strong coupling constant, expressed 
as «M *(Mz) (PDG, 2001). 


will be done in future calculations. 


We have stressed that lattice QCD is simply a way of handling QCD. It has the same a 
priori unknown parameters as QCD, the overall scale (equivalent to the coupling constant) 
and the quark masses. These parameters come from a deeper theory and must simply be 
fixed in the QCD Lagrangian using experiment and the results from a calculation in QCD. 
Ás described in Section 3, Lattice QCD provides the most direct way of doing this. The 
values for the parameters obtained are then useful input to other theoretical techniques. 

Determination of the lattice spacing at a given lattice bare coupling constant, is equiva- 
lent to (and can be converted into) a determination of the renormalised coupling constant, 
o4, at a physical scale in GeV. To compare to other determinations of a,, this can be 
converted to the MS scheme and run to Mz. Figure 20 shows a comparison of different 
determinations of o, from the Particle Data Group (PDG, 2001). It is clear that the 
lattice result is one of the most precise. 


All methods for determining o, have three components: 


1. Theoretical input: a perturbative expansion in o5, for some quantity. 
2. A value for that quantity. 


3. An energy scale. 


Most methods use an experimental result for stage 2, where the lattice uses a non- 
perturbative evaluation on the lattice of the vacuum expectation value of a simple short- 
distance gluonic operator (such as the plaquette). This avoids the problems of hadronisa- 
tion etc which reduce the precision of methods based on the experimental determination 
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of jet shapes or cross-sections. All methods use experiment for stage 3, and here the 
lattice-based determination needs an experimental result to fix the lattice spacing. A 
good quantity to use here is the orbital excitation energy (1P—15) in, say, the Y system 
since this is well-determined on the lattice and directly measured experimentally. 


Quark masses are also well determined on the lattice. Since quarks are not freely 
available to be weighed, as an electron would be, care must be taken in defining what 
exactly is meant by the quark mass. The bare mass in the lattice QCD Lagrangian for a 
particular action, determined by the requirement to get a particular hadron mass correct 
and converted to physical units, is a well-defined quantity but not very convenient. We 
can convert it perturbatively into, say, the running quark mass in the MS scheme. The 
best current determination of the 6 quark mass is in fact from the static approximation 
in which 6 quarks have infinite mass. There is no bare 6 quark mass in that case; instead 
the binding energy mg — m, is calculated, and from that, m, is determined. The binding 
energy is small compared to mpg and has only weak dependence on the b quark mass, so 
for this quantity the static approximation is a good one. The b quark mass obtained in 
this way is 4.30(10)GeV in the quenched approximation, with some indications that it is 
slightly lighter when dynamical quarks are included (Lubicz, 2001). 


4.3 Heavy hadron matrix elements 


Precise lattice calculations of matrix elements for B decay are essential to the experimental 
B factory programme (Stone, 2002). This aims to test the internal consistency of the 
Standard Model in which CP violation occurs through the Cabibbi-Kobayashi- Maskawa 
matrix. The weak decays of the b quark are particularly useful in giving us access to poorly 
known elements of this matrix. The unitarity of the CKM matrix can be represented by 
a triangle; the position of the upper vertex being constrained by the determination of 
angles and sides, see Figure 21. The angles are determined directly by measurement of 
asymmetries. The determination of the sides requires both the experimental measurement 


Figure 21. The unitarity triangle with constraints on the upper vertez obtained from 
different quantities (Hocker, 2001). The lower vertices are at 7 = 0, p = 0 and 1. 
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of a decay rate and its theoretical calculation.This allows the magnitude of one of the 
CKM elements to be extracted. Below we describe the lattice calculation of the matrix 
elements most important for this programme. The extent to which the unitarity triangle 
can be tested depends on both the experimental and the theoretical errors. It is critical to 
reduce the errors from lattice calculations to a few percent, otherwise they will dominate 
the uncertainties from experiment. 


The simplest 2-point matrix element that can be calculated on the lattice is that for 
the decay constant of the charged pseudoscalar heavy-light mesons (see Figure 8). For 
the B this is known as fg and it is obtained from the vacuum to B matrix element of the 
axial vector current which couples to the W. 


(014,]B) = pa fa. (37) 


The purely leptonic decay rate of the B meson is then proportional to f2 times kinematic 
factors times the square of the CKM element which multiplies the appropriate axial vector 
current in the Lagrangian, in this case 1,50 (Rosner, 2002). In principle an experimental 
determination of the leptonic decay rate could be combined with the lattice calculation 
to yield Va», but in practice the experiment is very hard to do because the rate is so low. 
For other heavy-light mesons, it may be possible. fp, has been measured experimentally, 
but not very precisely as yet. It can be used, with lattice calculations, to give Vas. 


It is important to realise that, although we are discussing the weak decay of a b or c 
quark, the calculations are done in lattice QCD. The quark cannot decay in isolation, but 
must be bound into a hadron by the confinement property of QCD. The determination 
of the decay matrix element must then take into account all the QCD interactions inside 
the hadron (see Figure 8) and this requires lattice QCD. We do not put the W boson on 
the lattice. As far as QCD is concerned the B meson annihilates into the vacuum. The 
virtual W boson decay to leptons is put in by hand when we calculate the decay rate. 


Lattice calculations of fg improved markedly through the 1990s (this has been true of 
most lattice calculations) as we got to grips with the systematic errors. Figure 22 shows a 
timeline of results in the quenched approximation. It shows both that lattice calculations 
have markedly improved and that early calculations had very unreliable estimates of their 
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Figure 22. A timeline of results for the B meson decay constant, fg, calculated in lattice 
QCD in the quenched approximation. 
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errors. The most recent and best calculations do a careful job of matching the lattice 
representation of the axial vector current to the continuum. For heavy-light mesons 
we have to be careful both about relativistic (Aqgcp/mg) corrections and discretisation 
corrections to the leading order lattice current. Since mga is a dimensionless number, 
these two corrections in fact appear together and can be considered simultaneously. The 
matching between lattice and continuum is currently done only to O(a.) and this is the 
major source of error in the quenched approximation. Table 1 shows a typical ‘error 
budget’ for such a calculation. We need a more precise matching, either to o? or non- 
perturbatively (both of which can be done with a lot of hard work), to improve the errors 


beyond the 1096 level. 


Source percent error 


statistical + fitting 3 
discretisation O((aA)?) 4 
perturbative O(o2, o2/(aM)) 7 
NRQCD O((A/M)?, as A/M) 2 
light quark mass 4 
a !(m,) 4 


Total 10 


i 


Table 1. Source of error in a typical lattice calculation of fg using NRQCD for the heavy 
quark in the quenched approximation. a zz 0.1fm, M is the b quark mass, A a typical QCD 
scale of a few hundred MeV and a, is evaluated at 2/a. 


Recent reviews of lattice results (Ryan, 2002), (Bernard, 2001) have given the following 
*world averages' for lattice results in the quenched approximation: 


. fe = 173 + 23MeV 
. fo, = 230 + 14MeV 
. fn./ fn = 1.15(3); fo./fo = 1.12(2). 


(Note that the B, does not decay purely leptonically but the calculation of the appropri- 
ate matrix element can still be done in lattice QCD and yields useful information on its 
dependence on the light quark mass.) Large-scale calculations on dynamical configura- 
tions are only just beginning, so unquenched results are still unclear. It seems likelv that 
decay constants will be 10-2096 larger unquenched. 


A more important quantity from the point of view of the B factory programme is the 
mixing amplitude for neutral B mesons, B? and B,. This mixing gives rise to a difference 
in mass between the CP-eigenstates, Am, which can be measured experimentally through 
oscillations between particle and anti-particle (Stone, 2002). The mixing amplitude is 
given by the ‘box diagram’ (see Figure 23) in which the 6 quark and light anti-quark 
convert to a b anti-quark and light quark through the mediation of virtual Ws and (pref- 
erentially) ¢ quarks. The mixing amplitude is then proportional to the matrix element 
of the box between, say, a B? and a p multiplied by the product of CKM elements 
Vg Va. The current determination of |Vj4|? from experiment and theory gives a curve 
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Figure 23. The B bor diagram, related to that of a 4-quark operator. 


on the unitarity triangle plot (marked Amga on Figure 21). Future experiments will be 
able to see oscillations of the B, and then ratios of Amp, /Amg will allow a more precise 
determination of [V;,/V4|?, since some of the systematic errors will cancel out. 


As explained earlier, W bosons do not appear in lattice QCD calculations. The matrix 
element of the box diagram is calculated in lattice QCD by replacing it with the equivalent 
4-quark operator which appears in the effective (low-energy) weak Hamiltonian (Rosner, 
2002). Conventionally this matrix element for the B is set equal to (8/3) f2 M2 Bp, giving 
a definition of the parameter confusingly called Bg. Bg is the amount by which the matrix 
element differs from the result that would be obtained by saturating the Hy vertex of 
Figure 23 with the vacuum (comparing this to the right hand picture of Figure 8 we can 
see that this would be /2). Bg is generally expected to be roughly 1, and this explains 
why lattice calculations originally concentrated on calculating fg. To calculate Bg is 
harder, but is now being done. It requires, as for fg, a careful matching between the 
lattice and the continuum, and this has again been done to O(a,) so far. 


Recent world averages for the renormalisation-group-invariant definition of Bp in the 
quenched approximation have been given as (Ryan, 2002), (Bernard, 2001): 


e Bg, = 1.30(12)(13 


) 
e fp, Bp, =230(40)MeV 
+ Bg, / Bg, = 1.00(4) 


© fo, V By, fea Bn, = 1.15(6) 


A lot of the matching errors cancel out in the ratios between B, and B, so that the errors 
in these ratios are less than 1096. The ratio may also not be significantly affected by 
unquenching. 


Heavy-light mesons decay semi-leptonically through a diagram in which the heavy 
quark changes flavor, emitting a virtual W, and the other (spectator) quark in the meson 
combines with the new quark flavor to make a new meson. In this way B mesons can 
decay to D or D* mesons if b — c and to 7 or p mesons if b — u. In each case the 
appropriate CKM element appears at the current vertex in the three-point diagram (see 
Figure 9) and can therefore be determined by a comparison of the experimental exclusive 
rate to the theoretical one. The ratio V,,/V4, gives an important circular constraint in 
the unitarity triangle (see Figure 21). 


The calculation of the matrix element for B semi-leptonic decay on the lattice re- 
quires the calculation and simultaneous fitting of the 3-point function of Figure 9 and 
the appropriate 2-point functions necessary to isolate the matrix element. It is therefore 
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significantly harder than a simple 2-point calculation. In addition the matrix element 
depends on q?, the squared difference of 4-momenta between the initial and final meson. 
This can take a range of values, because the decay is a three-body one. The matrix el- 
ement can then be written as a combination of form-factors which are q? dependent, in 
contrast to the two-body leptonic decay which was parameterised by a single number, fg. 
For example the pseudoscalar to pseudoscalar transition (e.g. B to D or 1) proceeds only 
through the vector current and has two form factors, f, and fo: 


M2 - M3, , MZ - MP, 
(P'(p')|\VilP(p)) = J+ (a?) (p p), — Ug 3 + Jola EU h (38) 


The differential decay rate is proportional to the square of f} because the leptonic current 
L, coupling to the W has q*L, = 0 for massless leptons. The pseudoscalar to vector 
transition proceeds through both the vector and axial currents and has 5 form factors, 3 
of which appear in the decay rate. 


To explore different values of q? for semi-leptonic decay on the lattice it is easiest to 
insert different 3-momenta at the final meson and at the current, and then work out the 
resulting 3-momentum of the initial meson. We are restricted to values of 3-momentum 
allowed on the lattice, ie. the components of p have the form pja = nj;2n/L, where 
n; = 0,1,2... and L is the number of lattice sites in the j direction. The smallest 
non-zero value of p; is then 27/(La) where La is the physical size of the lattice in a 
spatial direction. A big physical volume is then required to achieve a fine discretisation 
of momentum space and avoid a large jump from one momentum to the next. In general 
results at higher momenta are much noisier than those at small momenta (this is for 
the same reason that excited states are noisier than ground states, discussed above) and 
calculations tend to be restricted to a few of the smallest possible momenta. Discretisation 
errors will also be larger at larger values of pa, so systematic errors will be higher. 


For the matrix element for B to D™) semi-leptonic decay it is useful to consider both 
the b quark and the c quark in the heavy quark limit. In that limit, as discussed above, 
the Lagrangian for heavy quarks becomes insensitive to the heavy quark spin or flavour 
(Buchalla, 2002). The light quark cloud in the meson cannot tell whether it is surrounding 
a b or a c quark or one whose spin is pointing parallel or anti-parallel to its spin. Thus 
the form factors for B > D and B — D* will become identical (or vanish) and the same 
as the B — B elastic form factor, provided they are viewed as a function of the right 
variable. This is not q? but v-v' where v is the 4-velocity (p,/m) of the initial meson and 
v' is the 4-velocity of the final meson. v-v' is often given the symbol w. In the notation 
of Equation 38 w = (M2 + MZ, — q?)/(2MpMp). The limit w = 1 is known as the 
‘zero-recoil’ limit because this corresponds to the kinematic point where the B meson 
at rest decays to, say, a D meson at rest and the decay products of the W come out 
back-to-back. This point has maximum q? = (Mp — Mp:)?. 


The B — B elastic form factor takes the form 
(B(v')|V,|B(v)) = Ma&Qw)(v +v’) (39) 


in the limit of infinite b quark mass, where E(w) = f,(q?), f. = 0. E(w) is known 
as the Isgur-Wise function. £(1) = 1 is an absolute normalisation in the continuum 
because by, b is a conserved current. The lattice current is not a conserved one (except 
for the NRQCD/static actions) but if we are interested only in the shape of £(w) we can 
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Figure 24. The B — B elastic form factor calculated in lattice QCD using NRQCD for 
the b quark and plotted versus w (Hashimoto, 1996). 


renormalise to match 1 at w = 1. There have been several calculations of the B — B form 
factor on the lattice, for various heavy quark masses. Figure 24 shows such a calculation 
using NRQCD with a mass close to that for the b quark (Hashimoto, 1996). 


'The interest in calculating the Isgur- Wise function is that, in the Heavy Quark Symme- 
try picture described above, it is also applicable to B -> D and B — D* decays. In these 
cases, however, there is an additional overall perturbative renormalisation because 5»,c is 
not a conserved current, and there are corrections which appear as differences of inverse 
powers of the b and c quark masses. For kinematic reasons, B — D* is experimentally 
easier to measure in the w — 1 region. The differential decay rate is 


SE = Wel? Kw) Qu) (40) 


where Va is the CKM element that we want to determine, K(w) is a kinematic factor and 
F is the form factor for the decay. Figure 25 shows results from the CLEO collaboration 
(CLEO, 2000) for F(u)|V4|. The lighter hashed curve is the result from the lattice shown 
in Figure 24 rescaled by a constant to match at w = 1. Given lattice results for B -> D* 
rather than B -> B, the constant required for rescaling would be |V.4| which would then 
be determined. 


In fact, a number of simplifications can be made to the lattice calculation at the 
w — 1 point and so it is currently better to perform a phenomenological extrapolation of 
the experimental data to w — 1 and divide the extrapolated result by the lattice result for 
F(1). The Fermilab group, using heavy relativistic (Fermilab) quarks and O(a,) matching 
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Figure 25. |Va|F(w) extracted from the experimental B — D* decay rate plotted as a 
function of w (CLEO, 2000). The shorter curve on 1<w<1.15 is a rescaled version of 
the curve in Figure 24. 


to the continuum, give the most precise result so far: 
0.024 40.017 
Fg p*(1) — 0.913*5.017--0.030 (41) 


in which the first error comes from statistics and fitting and the second from systematic 
errors, including the effect of using the quenched approximation (Hashimoto, 2001). The 
resulting value of |V4| extracted depends on which experiment's value for |V44,|.7 (1) is 
taken. Using an average result (Stone, 2002) of 37.8 + 1.4 x 107? gives a value for Va 
of 41.4 + 1.5 + 1.7 x 107? where the final theoretical error comes from adding the lattice 
errors in quadrature. The lattice and experimental errors are currently of about the same 
size. The lattice error can be improved further in an unquenched calculation with a higher 
order matching of the lattice current to the continuum. 


B — light (7, p) semi-leptonic decay is rather harder to calculate on the lattice. In 
many ways it is more important to do, however, because continuum techniques, such as 
HQET, can give very little useful input. One difficulty is that lattice systematic errors are 
smallest where the B and, say, the v lattice momenta are smallest, around the zero-recoil 
point discussed above, but there is very little experimental data there. Most experimental 
data occurs at relatively low q? values (q? < 16GeV?) when the zero recoil point has 
@ = gax = (Mg — Mr)? = 26GeV. A comparison of lattice results for the form factors 
for B — wm decay is shown in Figure 26 (Bernard, 2001). Different lattice results are 
shown covering a range of q?. The reason that some results are at smaller q? than others 
is because some use relativistic quarks (marked NPclover) at a mass around the c quark 
mass rather than the b. For reasons discussed earlier, none of the lattice calculations can 
be done at the physical u, d quark masses and so must be chirally extrapolated to that 
point. This is done in a different way by different groups and has led to very different 
final results, even though the intermediate data does not show very different behaviour 
(see Figure 26). A better understanding of how the chiral extrapolation should be done 
will be required before precise lattice results will be available. Good experimental results 
in the q? region that the lattice can reach will then allow a determination of Vi». 
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Figure 26. Lattice results for the form factors for B— decay (Bernard, 2001). 


5 Conclusions 


Lattice QCD has come a long way from the original calculations of the 1970s. The original 
idea that we could solve a simple discretisation of QCD numerically by ‘brute force’ has 
been replaced by a more sophisticated approach. Unfortunately, to the uninitiated, this 
can look like cookery. I have tried to describe some of the calculational and technical 
details so that non-practitioners feel able to make an informed judgement about lattice 
calculations, and see where progress will be made in the future. There is no doubt, for 
example, that precise lattice calculations are needed to obtain maximum benefit from the 
huge experimental investment in B physics. In the next few years such calculations will 
become possible, at least for some quantities, and this will mark the ‘coming of age’ of 
the lattice QCD approach at last. 
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1 Introduction 


The Standard Model predicts that the only way that CP is violated is through the 
Kobayashi-Maskawa mechanism [1]. Specifically, the source of CP violation is a sin- 
gle phase in the mixing matrix that describes the charged current weak interactions of 
quarks. In the introductory chapter, we briefly review the present evidence that supports 
the Kobayashi-Maskawa picture of CP violation, as well as the various arguments against 
this picture. 


1.4 Why believe the Kobayashi-Maskawa mechanism? 
Experiments have measured to date three independent CP violating observables: 


e Indirect CP violation in K — mr decays [2] and in K — mv decays is given by: 


£y = (2.28 + 0.02) x 107? e7/4, (1) 


e Direct CP violation in K — mm decays [3, 4, 5, 6, 7] is given by 


/ 


= = (1.72 + 0.18) x 1073. (2) 


» CP violation in B — Ks decay and other, related modes has been measured 
[8, 9, 10, 11, 12]: 
Gykg = 0.79 + 0.10. (3) 


All three measurements are consistent with the Kobayashi-Maskawa picture of CP 
violation. In particular, the two recent measurements of CP violation in B decays [11, 12] 
have provided the first precision test of CP violation in the Standard Model. Since 
the model has passed this test successfully, we are able, for the first time, to make the 
following statement: The Kobayashi-Maskawa phase is, very likely, the dominant source 
of CP violation in low-energy flavor-changing processes. 
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In contrast, various alternative scenarios of CP violation that have been phenomeno- 
logically viable for many years are now unambiguously excluded. Two important examples 
are the following: 


e The superweak framework [13], that is, the idea that CP violation is purely indirect, 
is excluded by the evidence that <'/e # 0. 


e Approximate CP, that is, the idea that all CP violating phases are small, is excluded 
by the evidence that ay, = O(1). 


The experimental result (3) and its implications for theory signify a new era in the study 
of CP violation. In this series of lectures we will explain these recent developments and 
their significance. 


1.2 Why doubt the Kobayashi-Maskawa mechanism? 
1.2.1 The baryon asymmetry of the Universe 


Cosmology shows that the Kobayashi-Maskawa phase cannot be the only source of CP 
violation: baryogenesis, that is, the history of matter and antimatter in the Universe, 
cannot be accounted for by the Kobayashi-Maskawa mechanism. 


To understand this statement, let us provisionally switch off all CP violation. Then, 
for every process that occurs in Nature, the corresponding CP conjugate process proceeds 
with a precisely equal rate. Let us further assume that the initial conditions are such 
that the number density of quarks and the number density of the matching antiquarks 
are equal. Then, CP invariance guarantees that the number densities remain equal to 
each other along the history of the Universe. In other words, the baryon asymmetry, 
n = (ng — ng)/ny, is guaranteed to remain zero. Two particularly significant processes 
are proton-antiproton annihilation and production. While the first would happen at any 
temperature, the latter is allowed only if the energy of the photons is large enough to 
produce a proton-antiproton pair. At high enough temperatures, T 2 2m,, annihilation 
and production will keep the protons and antiprotons in equilibrium and their number 
densities would be (precisely equal to each other and) similar to the photon number den- 
sity, ng = ng % n4. But at temperatures well below GeV, proton-antiproton production 
slows down until it practically stops. Since annihilation continues to take place, the num- 
ber densities of protons and antiprotons (remain equal to each other but) decrease, and 
at present there would be practically neither matter nor antimatter. This is, of course, 
inconsistent with observations. 


Now let us switch on CP violation. That allows a different rate for a process and its 
CP conjugate. Such a situation would have relevant consequences if two more conditions 
are met [14]: there is a departure from thermal equilibrium and baryon number can be vi- 
olated. When all three conditions are satisfied, a difference between the number densities 
of quarks and of antiquarks can be induced. We assume that the number of quarks be- 
comes slightly larger than the number of antiquarks. This scenario is called baryogenesis. 
At the electroweak phase transition (temperatures of order a few hundred GeV, t ~ 107!! 
seconds) baryon number violating processes become highly suppressed, and the baryon 
number cannot change any longer. The history of matter and antimatter in the Universe 
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proceeds along the same lines as described in the previous paragraph. In particular, at 
temperatures well below GeV the number densities of protons and antiprotons decrease. 
There is however an important difference: at some time, practically all antiprotons would 
disappear. But the small surplus of protons have no matching antiprotons to annihilate 
with. It remains there forever. The resulting picture of the present Universe is then as 
follows: there is no antimatter. There is a small amount of matter, with the present ratio 
(ng/ny)o reflecting the baryon asymmetry, [(n, — ng)/n,]ga, induced by baryogenesis. 
This picture is qualitatively consistent with observations. Thus we have good reasons to 
think that we understand the general mechanism of baryogenesis. 


The important point for our purposes is that baryogenesis is a consequence of CP 
violating processes. Therefore the present baryon number, which is accurately deduced 
from nucleosynthesis constraints (for a recent analysis, see [15]), 


PB (5.5 0.5) x 107%, (4) 


ny 
is essentially a CP violating observable! It can be added to the list of known CP violating 
observables, Equations (1), (2) and (3). Within a given model of CP violation, one can 
check for consistency between the data from cosmology, Equation (4), and those from 
laboratory experiments. 


The surprising point is that the Kobayashi-Maskawa mechanism for CP violation fails 
to account for (4). It predicts present baryon number density that is many orders of 
magnitude below the observed value [16, 17, 18]. This failure is independent of other 
aspects of the Standard Model: the suppression of ng/n., from CP violation is much too 
strong, even if the departure from thermal equilibrium is induced by mechanisms beyond 
the Standard Model. This situation allows us to make the following statement: There 
must exist sources of CP violation beyond the Kobayashi-Maskawa phase. 


Three important examples of viable models of baryogenesis are the following: 


1. GUT baryogenesis (for a recent review see [19]): the source of the baryon asym- 
metry is in CP violating decays of heavy bosons related to grand unified theories. 
In general, baryon number is not a conserved quantity in GUTs. Departure from 
thermal equilibrium is provided if the lifetime of the heavy boson is long enough that 
it decays when the temperature is well below its mass. The relevant CP violating 
parameters are not expected to affect low energy observables. 


2. Leptogenesis (for a recent review see [20]): lepton asymmetry is induced by CP 
violating decays of heavy fermions that are singlets of the Standard Model gauge 
group (sterile neutrinos). Departure from thermal equilibrium is provided if the 
lifetime of the heavy neutrino is long enough that it decays when the temperature 
is below its mass. B + L-violating processes are fast before the electroweak phase 
transition and convert the lepton asymmetry into a baryon asymmetry. The CP 
violating parameters may be related to CP violation in the mixing matrix for the 
light neutrinos (but this is a model dependent issue [21]). 


3. Electroweak baryogenesis (for a review see [22]): the source of baryon asymme- 
try is the interactions of top (anti)quarks with the Higgs field during the electroweak 
phase transition. CP violation is induced, for example, by supersymmetric inter- 
actions. Sphaleron configurations provide baryon number violating interactions. 
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Departure from thermal equilibrium is provided by the wall between the false vac- 
uum ((@) = 0) and the expanding bubble with the true vacuum, where electroweak 
symmetry is broken. 


1.2.3 The strong CP problem 


Non-perturbative QCD effects induce an additional term in the SM Lagrangian, 


focn va ppoa 
£g = 3952 mea F” F? . (5) 


This term violates CP. In particular, it induces a neutron electric dipole moment (EDM). 
The leading contribution in the chiral limit is given by [23] 


dy = GnNNGnNN 
N 4n?Mn 
& 5x 107!8 @gcp ecm, (6) 


In(My /m,) 


where My is the nucleon mass, and gryn (Gann) is the pseudoscalar coupling (CP- 
violating scalar coupling) of the pion to the nucleon. (The leading contribution in the 
large N, limit was calculated in the Skyrme model [24] and leads to a similar estimate.) 
The experimental bound on dy is given by 


dy < 6.3 x 107% ecm [25]. (7) 
It leads to the following bound on @gcp: 


foco x, 1077". (8) 


Since qcn arises from non-perturbative QCD effects, it is impossible to calculate it. 
Yet, there are good reasons to expect that these effects should yield Ogcp = O(1) (for a 
clear review of this subject, see [26]). Within the SM, a value as small as (8) is unnatural, 
since setting ÓQcp to zero does not add symmetry to the model. [In particular, as we 
will see below, CP is violated by 6xm = O(1).] Understanding why CP is so small in the 
strong interactions is the strong CP problem. 


It seems then that the strong CP problem is a clue to new physics. Among the solutions 
that have been proposed are a massless u-quark (for a review, see [27]), the Peccei-Quinn 
mechanism (28, 29] and spontaneous CP violation. As concerns the latter, it is interesting 
to note that in various string theory compactifications, CP is an exact gauge symmetry 
and must be spontaneously broken [30, 31]. 


1.2.3 New physics 


Another motivation to measure CP violating processes is that almost any extension of 
the Standard Model provides new sources of CP violation. These sources often allow for 
significant deviations from the Standard Model predictions. Moreover, various CP violat- 
ing observables can be calculated with very small hadronic uncertainties. Consequently, 
CP violation provides an excellent probe of new physics. 
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1.3 Will new CP violation be observed in experiments? 


The SM picture of CP violation is testable because the Kobayashi-Maskawa mechanism is 
unique and predictive. These features are mainly related to the fact that there is a single 
phase that is responsible to all CP violation. As a consequence of this situation, one finds 
two classes of tests: 


e Correlations: many independent CP violating observables are correlated within the 
SM. For example, the SM predicts that the CP asymmetries in B — V Kg and in 
B — Ks, which proceed through different quark decay processes, are equal to 
each other. Another important example is the strong SM correlation between CP 
violation in B > Ks and in K > nv. 


e Zeros: since the KM phase appears in flavor-changing, weak-interaction couplings of 
quarks, and only if all three generations are involved, many CP violating observables 
are predicted to be negligibly small. For example, the SM predicts no CP violation 
in the lepton sector, practically no CP violation in flavor-diagonal processes (i.e. a 
tiny electric dipole moment for the neutron) and very small CP violation in tree 
level D decays. 


In addition, some of the CP violating observables can be calculated with very small 
hadronic uncertainties. 


The strongest argument that new sources of CP violation must exist in Nature comes 
from baryogenesis. Whether the CP violation that is responsible for baryogenesis would 
be manifest in measurements of CP asymmetries in B decays depends on two issues: 


e The scale of the new CP violation: if the relevant scale is very high, such as in GUT 
baryogenesis or leptogenesis, the effects cannot be signalled in these measurements. 
To estimate the limit on the scale, the following three facts are relevant: First, the 
Standard Model contributions to CP asymmetries in B decays are O(1). Second, 
the expected experimental accuracy would reach in some cases the few percent 
level. Third, the contributions from new physics are expected to be suppressed by 
(Agw/Aup)". The conclusion is that, if the new source of CP violation is related 
to physics at Anp >> 1 TeV, it cannot be signalled in B decays. Only if the true 
mechanism is electroweak baryogenesis, can it potentially affect B decays. 


» The flavor dependence of the new CP violation: if it is flavor diagonal, its effects 
on B decays would be highly suppressed. It can still manifest itself in other, flavor 
diagonal CP violating observables, such as electric dipole moments. 


We conclude that new measurements of CP asymmetries in meson decays are particu- 
larly sensitive to new sources of CP violation that come from physics at (or below) the few 
TeV scale and that are related to flavor changing couplings. This is, for example, the case 
in certain supersymmetric models of baryogenesis [32, 33]. The search for electric dipole 
moments can reveal the existence of new flavor diagonal CP violation. Of course, there 
could be new flavor physics at the TeV scale that is not related to the baryon asymmetry 
and may give signals in B decays. The best motivated extension of the SM where this 
situation is likely is that of supersymmetry. We will discuss supersymmetric CP violation 
in the last chapter. 
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2 The Kobayashi-Maskawa mechanism 


2.1 Yukawa interactions are the source of CP violation 


A model of elementary particles and their interactions is defined by three ingredients: 
(1) the symmetries of the Lagrangian, (2) the representations of fermions and scalars and 
(3) the pattern of spontaneous symmetry breaking. For the Standard Model (SM) the 
corresponding ingredients are as follows: 


» The gauge symmetry is 


Gsm = SU(3)c x SU(2), x U(Y)y. (9) 


e There are three fermion generations, each consisting of five representations of Gay: 
Q3, 2) 1/6; URi(3, 142/35 Dhi(3; 1)-1/3; Liu, 2)-i2; Eni(l, 1)-1. (10) 


Our notation means that, for example, left-handed quarks, Q}, are triplets of 
SU (3)c, doublets of SU(2), and carry hypercharge Y = +1/6. The super-index 7 
denotes interaction eigenstates. The sub-index i = 1, 2,3 is the flavor (or generation) 
index. 


There is also a single scalar representation, 
$(1, 2) 41/2. (11) 


» The scalar ¢ assumes a vacuum expectation value (VEV), 


so that the gauge group is spontaneously broken, 


Gsm — SU (3)c x U(1)em.- (13) 


The Standard Model Lagrangian, Lsm, is the most general renormalisable Lagrangian 
that is consistent with the gauge symmetry (9). It can be divided to three parts: 


Lsm = Luinetic + Lutiggs + Lyukawa- (14) 


We consider each of these terms in turn, 


Kinetic terms 


For the kinetic terms, to maintain gauge invariance, one has to replace the derivative with 
a covariant derivative: 


D! = 0" + igsGË La + igWfT, + ig B"Y. (15) 
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Here G# are the eight gluon fields, W; the three weak interaction bosons and B" the 
single hypercharge boson. The L,'s are SU(3)c generators (the 3 x 3 Gell-Mann matrices 
A4/2 for triplets, 0 for singlets), the T;'s are SU (2), generators (the 2 x 2 Pauli matrices 
7/2 for doublets, 0 for singlets), and Y are the U(1)y charges. For example, for the 
left-handed quarks Q}, we have 


Lrinetic (QL) = iQLiy, (o + 59sGara + 59Wi n, + = P") Lo (16) 


while for the left-handed leptons L^, we have 
Crinaie( Lr) = iL (O Wen — ig B") LL. (17) 


These parts of the interaction Lagrangian are always CP conserving. 


The Higgs potential 


The Higgs potential, which describes the scalar self interactions, is given by: 
Luiggs = BOTO — A(d' d). (18) 


For the Standard Model scalar sector, where there is a single doublet, this part of the 
Lagrangian is also CP conserving. For an extended scalar sector, such as that of a two 
Higgs doublet model, Lyiggs can be CP violating. Even in the case that it is CP symmetric, 
it may lead to spontaneous CP violation. 


Quark Yukawa interactions 
The quark Yukawa interactions are given by 
— Casa = Ya QLODR; + Yg QL PU Ry + h.c. (19) 


This part of the Lagrangian is, in general, CP violating. More precisely, CP is violated if 
and only if [34] 

Im (det[Y ^Y, y"y"t]} z o. (20) 

An intuitive explanation of why CP violation is related to compler Yukawa couplings 


goes as follows. The hermiticity of the Lagrangian implies that Lyukawa has its terms in 
pairs of the form 


Yarn; + Y5 Vrid Uns. (21) 
A CP transformation exchanges the operators 


Dude; € Unjd Uri, (22) 


but leaves their coefficients, Y; and Y;*, unchanged. This means that CP is a symmetry 


ij? 
: Lov 
Of Lyukawa if Yiz = Yj; 
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Lepton Yukawa interactions 


The lepton Yukawa interactions are given by 
— LEIS = YLT GEL, + h.c.. (23) 


It leads, as we will see in the next section, to charged lepton masses but predicts massless 
neutrinos. Recent measurements of the fluxes of atmospheric and solar neutrinos provide 
evidence for neutrino masses. That means that Lsm cannot be a complete description 
of Nature. The simplest way to allow for neutrino masses is to add dimension-five (and, 
therefore, non-renormalisable) terms, consistent with the SM symmetry and particle con- 
tent: 
. Ys 

-c45 = M ibo + hee. (24) 
The parameter M has dimension of mass. The dimensionless couplings Yj; are symmetric 
(Y5 = Y3). We will refer to the SM extended to include the terms cin> of Equation (24) 
as the “extended SM” (ESM): 


LESM = Lkinetic + É Higgs + LYukawa + [o M (25) 


The inclusion of nonrenormalisable terms is equivalent to postulating that the SM is only 
a low energy effective theory, and that new physics appears at the scale M. 


How many independent CP violating parameters are there in £T ? Each of the 
two Yukawa matrices Y? (q = u,d) is 3 x 3 and complex. Consequently, there are 18 real 
and 18 imaginary parameters in these matrices. Not all of them are, however, physical. 
One can think of the quark Yukawa couplings as spurions that break a global symmetry, 


This means that there is freedom to remove 9 real and 17 imaginary parameters [the 
number of parameters in three 3 x 3 unitary matrices minus the phase related to U(1)g]. 
We conclude that there are 10 quark flavor parameters: 9 real ones and a single phase. 
This single phase is the source of CP violation in the quark sector. 


How many independent CP violating parameters are there in the lepton Yukawa in- 
teractions? The matrix Y* is a general complex 3 x 3 matrix and depends, therefore, on 9 
real and 9 imaginary parameters. The matrix Y" is symmetric and depends on 6 real and 
6 imaginary parameters. Not all of these 15 real and 15 imaginary parameters are physi- 
cal. One can think of the lepton Yukawa couplings as spurions that break (completely) a 
global symmetry, 


U(3), x U(3)g. (27) 
This means that 6 real and 12 imaginary parameters are not physical. We conclude that 


there are 12 lepton flavor parameters: 9 real ones and three phases. These three phases 
induce CP violation in the lepton sector. 
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2.2 CKM mixing is the (only!) source of CP violation in the 
quark sector 


Upon the replacement Re(¢°) > (v-- H?)/4/2 [see Equation (12)], the Yukawa interactions 
(19) give rise to mass terms: 


—£4, = (Ma)igD] Dh; + (Mi) ULUk; + h.c., (28) 
where v 
M, = y^ (29) 


and we decomposed the SU (2); quark doublets into their components: 
o= (Dh): (0 
The mass basis corresponds, by definition, to diagonal mass matrices. We can always 
find unitary matrices Vj, and V;g such that 
Vyr Ma Vir = Mes (q =u, d), (31) 
with Mäe diagonal and real. The quark mass eigenstates are then identified as 
qui = (Vou )ig Qty, qmi = (Vandijdh; (q= ud). (32) 


The charged current interactions for quarks [that is the interactions of the charged 
SU (2), gauge bosons WF = 4 (W; F iW;)], which in the interaction basis are described 
by (16), have a complicated form in the mass basis: 

g 


—Liys = vg Vat Var ides + h.c.. (33) 


The unitary 3 x 3 matrix, 
Voxm = Var V. (Vom V5 =1), (34) 


is the Cabibbo-Kobayashi-Maskawa (CKM) mizing matriz for quarks [35, 1]. A unitary 
3 x 3 matrix depends on nine parameters: three real angles and six phases. The form of 
this matrix is not unique and in fact there are two distinct freedoms which must be fixed. 


1. There is freedom in defining Vox in that we can permute between the various 
generations. This freedom is fixed by ordering the up quarks and the down quarks 
by their masses, i.e. (u1, u2, u3) > (u, c, t) and (di, do, d3) — (d, s, b). The elements 
of Vex are written as follows: 


Vud Vus Vab 
Vekm = | Vea Ves Væl. (35) 
Via Vis Ve 
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2. There is further freedom in the phase structure of Vcxm. Let us define P, (q = u,d) 
to be diagonal unitary (phase) matrices. Then, if instead of using Voz, | and Var for 
the rotation (32) to the mass basis, we use Vu and Vor, defined by Vir = = P, v VaL 
and Vir = = P,Vor, we still maintain a legitimate mass basis since MÈ” remains 
unchanged by such transformations. However, Vcxm does change: 


Veku > B,Vckw Pj. (36) 


This freedom is fixed by demanding that Vcyy has the minimal number of phases. In 
the three generation case Vcky has a single phase. (There are five phase differences 
between the elements of P, and P, and, therefore, five of the six phases in the CKM 
matrix can be removed.) This is the Kobayashi-Maskawa phase Óókw which is the 
single source of CP violation in the quark sector of the Standard Model [1]. 


As a result of the fact that Vexm is not diagonal, the WF gauge bosons couple to 
quark (mass eigenstates) of different generations. Within the Standard Model, this is the 
only source of flavor changing quark interactions. 


2.3 The three phases in the MNS mixing matrix 


The leptonic Yukawa interactions (23) and (24) give rise to mass terms: 


—Liy = (M.)gel;eh; + (Mov lvl; + h.c., (37) 
where 
M.= Y°, M,=2-y" (38) 
e 7T /2 , v 2M , 
and we decomposed the SU (2), lepton doublets into their components: 
vi. 
LI, = (5) (39) 


We can always find unitary matrices Vez and V, such that 
V.L M. Mj Vj, = diag(m2, m2, m2), | V, MiM, Vj = diag(m?, m], m3). (40) 


The charged current interactions for leptons, which in the interaction basis are described 
by (17), have the following form in the mass basis: 


ct, = Se ^ (V,L V ivy Wg + h.c.. (41) 


The unitary 3 x 3 matrix, 
Vuns = VeL Vj, (42) 
is the Maki-Nakagawa-Sakata (MNS) mizing matriz for leptons [36]. As for the CKM 


matrix, the form of the MNS matrix is not unique: there are the same two freedoms but 
they are fixed by different conventions: 
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1. We can permute between the various generations. This freedom is usually fixed in 
the following way. We order the charged leptons by their masses, i.e. (e1,€5, 63) > 
(e, u, T). For the neutrinos, one takes into account that the interpretation of atmo- 
spheric and solar neutrino data in terms of two-neutrino oscillations implies that 
Amir > Amo. It follows that one of the neutrino mass eigenstates is separated 
in its mass from the other two, which have a smaller mass difference. The conven- 
tion is to denote this separated state by v3. For the remaining two neutrinos, 4 
and m, the convention is to call the heavier state v. In other words, the three mass 
eigenstates are defined by the following conventions: 


[Amil > [Amil Am3 > 0. (43) 


Note in particular that v can be either heavier or lighter than v4». The elements 
of Vans are written as follows: 


Ver Va Ves 
Vans = Va Vaz Via . (44) 
Va Va Va 


2. There is further freedom in the phase structure of Vus. (In the MNS paper [36] 
there is no reference to CP violation.) One can change the charged lepton mass basis 
by the transformation e(j gy — CCL, Ri = (Pj)ueu,,gy, Where P, is a phase matrix. 
There is, however, no similar freedom to redefine the neutrino mass eigenstates: 
From Equation (37) one learns that a transformation v; — P,v; will introduce 
phases into the diagonal mass matrix. This is related to the Majorana nature of 
neutrino masses, assumed in Equation (24). The allowed transformation modifies 
Vus: 

Vuns > PeVmns. (45) 
This freedom is fixed by demanding that Vyns will have the minimal number of 
phases. Out of six phases of a unitary 3 x 3 matrix, the multiplication by P, can be 
used to remove three. We conclude that the three generation Vmns matrix has three 
phases. One is the analog of the Kobayashi-Maskawa phase: it is the only source 
of CP violation in processes that conserve lepton number, such as neutrino flavor 
oscillations. The other two phases can affect lepton number changing processes. 


With Vuns Æ 1, the W+ gauge bosons couple to lepton (mass eigenstates) of different 
generations. Within the ESM, this is the only source of flavor changing lepton interactions. 


2.4 The flavor parameters 


Examining the quark mass basis, one can easily identify the flavor parameters. In the 
quark sector, we have six quark masses and four mixing parameters: three mixing angles 
and a single phase. The fact that there are only three real and one imaginary physical 
parameters in Vcxm can be made manifest by choosing an explicit parameterisation. For 


example, the standard parameterisation [37], used by the Particle Data Group, is given 
by 
€12C13 $12C13 siye ? 


ið ið 
Vokm = | 781223 — €12823813€ €12C23 — 812523813€ $23€13 |, (46) 


5 » 
812823 — €12€23813€"  —012823 — 812€23813€" — €23€13 
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where ci = cos 6j; and sj; = sin6;j. The three sin 6;; are the three real mixing parameters 
while ó is the Kobayashi-Maskawa phase. Another, very useful, example is the Wolfenstein 
parameterisation, where the four mixing parameters are (A, A, p, ) with A = |V,,| = 0.22 
playing the role of an expansion parameter and 7 representing the CP violating phase 
[38]: 


2 
1-4 A — AM(p- in) 
Voxm = E 1- x AX + O(A*). (47) 
AX(-p-ip -AX 1 


Various parameterisations differ in the way that the freedom of phase rotation, Equa- 
tion (36), is used to leave a single phase in Vckw. One can define, however, a CP violating 
quantity in Verm that is independent of the parameterisation [34]. This quantity, Jckw; 
is defined through 


3 
Im[Vij Vie Vir Ves | = Jckd Y. 6km6im — (5j, E 1 1, 2,3). (48) 


m,n-l 
In terms of the explicit parameterisations given above, we have 


2 B 6 42 
JckM = €12€23C1 3912523513 sind x À A 7. (49) 


It is interesting to translate the condition (20) to the language of the flavor parameters 
in the mass basis. One finds that the following is a necessary and sufficient condition for 
CP violation in the quark sector of the SM: 


Am? Am? Am, Am Am; Am? Jckw #0. (50) 


Here 
Am; = m} — mi. (51) 


Equation (50) puts the following requirements on the SM in order that it violates CP: 


e Within each quark sector, there should be no mass degeneracy; 
» None of the three mixing angles should be zero or 7/2; 


» The phase should be neither 0 nor v. 


For the lepton sector of the ESM, the flavor parameters are the six lepton masses, 
and six mixing parameters: three mixing angles and three phases. One can parameterise 
Vuws in a convenient way by factorising it into Vung = VP. Here P is a diagonal 
unitary matrix that depends on two phases, e.g. P = diag(e'?',e'?*, 1), while V can 
be parameterised in the same way as (46). The advantage of this parameterisation is 
that for the purpose of analyzing lepton number conserving processes and, in particular, 
neutrino flavor oscillations, the parameters of P are usually irrelevant and one can use the 
same Chau-Keung parameterisation as is being used for Vckw. (An alternative way to 
understand these statements is to use a single-phase mixing matrix and put the extra two 
phases in the neutrino mass matrix. Then it is obvious that the effects of these ‘Majorana- 
phases’ always appear in conjunction with a factor of the Majorana mass that is a lepton 
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number violating parameter.) On the other hand, the Wolfenstein parameterisation (47) 
is inappropriate for the lepton sector: it assumes |Vo3| « |Via| « 1, which does not hold 
here. 

In order that the CP violating phase ô in V would be physically meaningful, i.e. there 
would be CP violation that is not related to lepton number violation, a condition similar 
o (50) should hold: 


Am? Am2, Am?, Ami, AM3 AM3 Juns #0. (52) 


2.5 The unitarity triangles 


A very useful concept is that of the unitarity triangles. We will focus on the quark sector, 
but analogous triangles can be defined in the lepton sector. The unitarity of the CKM 
matrix leads to various relations among the matrix elements, e.g. 


VuaVus + VeaVes + ViaVis = 0, (53) 
Vas Vab + Vos V + Vis Vg = 9, (54) 
VuaVus + Voa Va +VaVa = 0 (55) 


Each of these three relations requires the sum of three complex quantities to vanish and 
so can be geometrically represented in the complex plane as a triangle. These are “the 
unitarity triangles”, although the term “unitarity triangle” is usually reserved for the 
relation (55) only. It is a surprising feature of the CKM matrix that all unitarity triangles 
are equal in area: the area of each unitarity triangle equals |Jcxm|/2 while the sign of 
JckM gives the direction of the complex vectors around the triangles. 


The rescaled unitarity triangle is derived from (55) by (a) choosing a phase convention 
such that (Va V3) is real, and (b) dividing the lengths of all sides by |V, V3]. Step 
(a) aligns one side of the triangle with the real axis, and step (b) makes the length 
of this side 1. The form of the triangle is unchanged. Two vertices of the rescaled 
unitarity triangle are thus fixed at (0,0) and (1,0). The coordinates of the remaining 
vertex correspond to the Wolfenstein parameters (p,n). The area of the rescaled unitarity 
triangle is |n|/2. 


Depicting the rescaled unitarity triangle in the (p,7) plane, the lengths of the two 
complex sides are 


Vud Vub 2173 Via Vi 
EE = Q R= = y (1 - p)? 2, 96 
Ry vb vot (= |TV, (1—p)? +n (56) 
The three angles of the unitarity triangle are defined as follows [39, 40]: 
vali E as 
a=arg|-—*=|, @=arg|- , ¥y=arg . (57) 
| Vaa Vab Va V Va Va 


They are physical quantities and can be independently measured by CP asymmetries in B 
decays [41, 42, 43, 44, 45]. It is also useful to define the two small angles of the unitarity 
triangles (54) and (53): 


V,, Ve Vi 
B, = arg E e| , Bx = arg E es al . (58) 
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Figure 1. Present Standard Model constraints and the result from the global CKM fit. 


To make predictions for future measurements of CP violating observables, we need to 
find the allowed ranges for the CKM phases. There are three ways to determine the CKM 
parameters (see e.g. [46]): 


» Direct measurements are related to SM tree level processes. At present, we have 
direct measurements of |Vug|, |Vus|; |Vusl, [Veal [Ves], |Væl and |V] 


e CKM Unitarity (VikmVckm = 1) relates the various matrix elements. At present, 
these relations are useful to constrain {Via|, |Visl, |Va| and [Ves]. 


e Indirect measurements are related to SM loop processes. At present, we con- 
strain in this way |VisVia| (from Amg and Amp,) and ókw or, equivalently, 7 or 8 
(from £x and ayx,). 


When all available data are taken into account, one finds [47]: 


A = 0.2221 0.0021, A = 0.827 + 0.058, (59) 
p = 0.23+0.11, n= 0.37 + 0.08, (60) 
sin28 = 0.7740.08, sin2a — —0.21 +0.56, 0.43 < sin? y< 0.91. — (61) 


Of course, there are correlations between the various parameters. The full information in 
the (p, 9) plane is given in Figure 1 [47]. 
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2.6 Uniqueness of the standard model picture of CP violation 


In the previous subsections, we have learnt several features of CP violation as explained 
by the Standard Model. It is important to understand that various reasonable (and 
often well-motivated) extensions of the SM provide examples where some or all of these 
features do not hold. Furthermore, until a few years ago, none of the special features 
of the Kobayashi-Maskawa mechanism of CP violation was experimentally tested. This 
situation has dramatically changed recently. Let us survey some of the SM features, how 
they can be modified with new physics, and whether experiment has shed light on these 
questions. 


1. ókw is the only source of CP violation in meson decays. This is arguably the most 
unique feature of the SM and gives the model a strong predictive power. It is violated 
in almost any low-energy extension. For example, in the supersymmetric extension 
of the SM there are 44 physical CP violating phases, many of which affect meson 
decays. The measured value of ayx, is consistent with the correlation between K 
and B decays that is predicted by the SM. It is therefore very likely that ókw is 
indeed the dominant source of CP violation in meson decays. 


2. CP violation is small in K — am decays because of flavor suppression and not 
because CP is an approrimate symmetry. In many (though certainly not all) super- 
symmetric models, the flavor suppression is too mild, or entirely ineffective, requiring 
approximate CP to hold. The measurement of ayx, = O(1) confirms that not all 
CP violating phases are small. 


3. CP violation appears in both AF — 1 (decay) and AF — 2 (mixing) amplitudes. 
Superweak models suggest that CP is violated only in mixing amplitudes. The 
measurement of e'/e confirms that there is CP violation in AS = 1 processes. 


4. CP is not violated in the lepton sector. Models that allow for neutrino masses, such 
as the ESM framework presented above, predict CP violation in leptonic charged 
current interactions. The data from neutrino oscillation experiments makes it very 
likely that charged current weak interactions violate CP also in the lepton sector. 


5. CP violation appears only in the charged current weak interactions and in conjunc- 
tion with flavor changing processes. Here both various extensions of the SM (such 
as supersymmetry) and non-perturbative effects within the SM (@gcp) allow for CP 
violation in other types of interactions and in flavor-diagonal processes. In particu- 
lar, it is difficult to avoid flavor-diagonal phases in the supersymmetric framework. 
The fact that no electric dipole moment has been measured yet poses difficulties 
to many models with diagonal CP violation (and, of course, is responsible to the 
strong CP problem within the SM). 


6. CP is explicitly broken. In various extensions of the scalar sector, it is possible 
to achieve spontaneous CP violation. It will be very difficult to test this question 
experimentally. 


This situation, where the Standard Model has a very unique and predictive description 
of CP violation and the number of experimentally measured CP violating observables is 
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very limited (eg, &'/e and axs), is the basis for the strong interest, experimental and 
theoretical, in CP violation. There are two types of unambiguous tests concerning CP 
violation in the Standard Model: First, since there is a single source of CP violation, 
all observables are correlated with each other. For example, the CP asymmetries in 
B — Kg and in K — mv» are strongly correlated [48, 49, 50]. Second, since CP 
violation is restricted to flavor changing fermion processes, it is predicted to be highly 
suppressed in the lepton sector and practically vanish in flavor diagonal processes. For 
example, the transverse lepton polarisation in semileptonic meson decays, CP violation 
in t£ production, and (assuming @gcp = 0) the electric dipole moment of the neutron are 
all predicted to be orders of magnitude below the (present and near future) experimental 
sensitivity. We conclude that it is very important to search for CP violation in many 
different systems. 


3 Meson decays 


In the previous section, we explained how CP violation arises in the Standard Model. In 
the next three sections, we would like to understand the implications of this theory for 
the phenomenology of CP violation in K, D and B decays. To do so, we first present 
a model-independent analysis of CP violation in meson decays. We distinguish between 
three different types of CP violation in meson decays: 


» CP violation in mixing, which occurs when the two neutral mass eigenstate admix- 
tures cannot be chosen to be CP-eigenstates; 


« CP violation in decay, which occurs in both charged and neutral decays, when the 
amplitude for a decay and its CP-conjugate process have different magnitudes; 


» CP violation in the interference of decays with and without mixing, which occurs 
in decays into final states that are common to B® and B^. 


3.1 Notations and formalism 


'To define these three types and to discuss their theoretical calculation and experimental 
measurement, we first introduce some notation and formalism. We refer specifically to B 
meson mixing and decays, but most of our discussion applies equally well to K, B, and 
D mesons. 


A B? meson is made from a b-type antiquark and an d-type quark, while the B? meson 
is made from a b-type quark and an d-type antiquark. Our phase convention for the CP 
transformation law of the neutral B mesons is defined by 


CP|B)) = »g|B?, CP|B°) = w$|B°), (|upg| = 1). (62) 
Physical observables do not depend on the phase factor wp. 


The light, Br, and heavy, By, mass eigenstates can be written as linear combinations 
of B? and B°: 


|B) 
|Bu) 


p|B^) + q|B"), 
p|B^) — q|B^), (63) 
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with 
la? + |p|? = 1. (64) 


The mass difference Amp and the width difference AT g are defined as follows: 
Am = My — Mr, Ar = Tg- rz. (65) 


The average mass and width are given by 


My +M Ta +r 
pe wit pa HHL (66) 
2 2 
It is useful to define dimensionless ratios z and y: 
re Am | AT (67 
"T^ "sw 


The time evolution of the mass eigenstates is simple: 


|Bu(t)) e Mate Tut? | By), 
|Br(t)) = eMe TH By, (68) 


The time evolution of the strong interaction eigenstates is complicated and obeys a 


Schródinger-like equation, 
4 /B i B 
i (B) - (9 - s) (8): (69) 


where M and T are 2 x 2 Hermitian matrices. The off-diagonal terms in these matrices, 
My. and Tj, are particularly important in the discussion of mixing and CP violation. 
Mis is the dispersive part of the transition amplitude from B? to B°, while l'i; is the 
absorptive part of that amplitude. Solving the eigenvalue equation gives 


1 

(Am)? — (Ary = (4|Myo|? — |Py2|”), AmAT = 4Re(Mi;T 75), (70) 
q__ 2M$-ih — Am- Sar mn) 
p Am — LAT 2M2 — iTi 


In the B system, |li»| < |Mi2| (see discussion below), and then, to leading order in 
[D12/Mis|, Equations (70) and (71) can be written as 


2Re( My T 
Amp = 2|Mis], AT, = A (72) 
a Mi (73) 
p [Mis 


To discuss CP violation in mixing, it is useful to write Equation (71) to first order in 
[T12/ Mia| [rather than to zeroth order as in (73)]: 


oe [i-am]. (74) 
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To discuss CP violation in decay, we need to consider decay amplitudes. The CP 
transformation law for a final state f is 


CPI) wif)  CPIf)swrlf (lel =1)- (75) 


For a final CP eigenstate f = f = fcp, the phase factor wy is replaced by rg, = +1, the 
CP eigenvalue of the final state. We define the decay amplitudes A; and A; according to 


Ay = (f[HalB°), |. A; = (f|HalB°), (76) 
where Ha is the decay Hamiltonian. 


CP relates Ay and A j. There are two types of phases that may appear in Ay and A F 
Complex parameters in any Lagrangian term that contributes to the amplitude will appear 
in complex conjugate form in the CP-conjugate amplitude. Thus their phases appear in 
A, and A 7 with opposite signs. In the SM these phases occur only in the mixing matrices 
that parameterise the charged current weak interactions, hence these are often called 
“weak phases”. The weak phase of any single term is convention dependent. However the 
difference between the weak phases in two different terms in A; is convention independent 
because the phase rotations of the initial and final states are the same for every term. 
A second type of phase can appear in scattering or decay amplitudes even when the 
Lagrangian is real. Such phases do not violate CP and they appear in Ay and A j with 
the same sign. Their origin is the possible contribution from intermediate on-shell states 
in the decay process, that is an absorptive part of an amplitude that has contributions 
from coupled channels. Usually the dominant re-scattering is due to strong interactions 
and hence the designation "strong phases" for the phase shifts so induced. Again only the 
relative strong phases of different terms in a scattering amplitude have physical content, 
an overall phase rotation of the entire amplitude has no physical consequences. Thus it 
is useful to write each contribution to A in three parts: its magnitude A;; its weak phase 
term €'9*:; and its strong phase term e'*. Then, if several amplitudes contribute to B > f, 
we have "n 

E _ |X; Ave i (77) 
EM A; y^. A;eib--e) 


To discuss CP violation in the interference of decays with and without mixing, we 
introduce a complex quantity A, defined by 
q Ay 
Ap = 7. 78 
57 pA (78) 
We further define the CP transformation law for the quark fields in the Hamiltonian 
(a careful treatment of CP conventions can be found in [51)): 
q > wd, F > w,q, (lw =1). (79) 
The effective Hamiltonian that is relevant to Mj; is of the form 
Hae xx e*2ión [à _ o] n [b^ — DE (80) 


where 2óg is a CP violating (weak) phase. (We use the SM V— A amplitude, but the 
results can be generalised to any Dirac structure.) For the B system, where |I| < | Mis], 
this leads to 

E = wgw wae "98, (81) 
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(We implicitly assumed that the vacuum insertion approximation gives the correct sign 
for Mis. In general, there is a sign(Bg) factor on the right hand side of Equation (81) 
[52].) To understand the phase structure of decay amplitudes, we take as an example the 
b > qqd decay (q = u or c). The decay Hamiltonian is of the form 


Ha x et% [a^ (1 — 45) ¢] [by,(1 — y5)q] + € 9r "(1 — 59) [dv — vs)q], — (82) 


where ó; is the appropriate weak phase. (Again, for simplicity we use a V —A structure, 
but the results hold for any Dirac structure.) Then 


= Wywpupwge MOF (83) 


m 


Equations (81) and (83) together imply that for a final CP eigenstate, 


cp = Nope Rtn), (84) 


3.2 The three types of CP violation in meson decays 
3.2.1 CP violation in mixing 


la/p| # 1. (85) 


This type of CP violation results from the mass eigenstates being different from the CP 
eigenstates, and requires a relative phase between Mj» and ['jo. For the neutral B system, 
this effect could be observed through the asymmetries in semileptonic decays: 


T(B& (t) > €vX) — P(Bbays(t) > t7 vX) 


= , 8 
“SL T(B&, (0) > €vX) + T(BSL) > €vX) (86) 
In terms of q and p, fol 
1 — lq/p 
-ICMP 87 
esL T+ |a/pl! (87) 


CP violation in mixing has been observed in the neutral K system (Re ex # 0). 


In the neutral B system, the effect is expected to be small, x, O(107?). The reason is 
that, model independently, the effect cannot be larger than O(AT'g/ Amp). The difference 
in width is produced by decay channels common to B? and B?. The branching ratios for 
such channels are at or below the level of 107?. Since various channels contribute with 
differing signs, one expects that their sum does not exceed the individual level. Hence, 
we can safely assume that AF g/Tg = O(107?). On the other hand, it is experimentally 
known that Ampg/Ug £ 0.7. 


To calculate ası, we use (87) and (74), and get: 
as, = Im(L12/Mj;). (88) 


To predict it in a given model, one needs to calculate Mis and L12. This involves large 
hadronic uncertainties, in particular in the hadronisation models for T42. 
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3.2.2 CP violation in decay 


|A7/Aj| #1. (89) 
This appears as a result of interference among various terms in the decay amplitude, and 


will not occur unless at least two terms have different weak phases and different strong 
phases. CP asymmetries in charged B decays, 


T(B*—f*)-nB > f`) 


OS BES ET > f) (90) 
are purely an effect of CP violation in decay. In terms of the decay amplitudes, 
1-|Ar-/Ae Tl 
= os 1 
H* V4 [Ay Ap? (91) 


CP violation in decay has been observed in the neutral K system (Re & # 0). 


To calculate aj+, we use (91) and (77). For simplicity, we consider decays with con- 
tributions from two weak phases and with A» < A,. We get: 


ape = —2(As/ Ai) sin(ó» — 6i) sin(ó» — di). (92) 


The magnitude and strong phase of any amplitude involve long distance strong interaction 
physics, and our ability to calculate these from first principles is limited. Thus quantities 
that depend only on the weak phases are much cleaner than those that require knowledge 
of the relative magnitudes or strong phases of various amplitude contributions, such as 
CP violation in decay. 


3.2.3 CP violation in interference between decays with and without mixing 


Im Ag, # 0. (93) 


This effect is the result of interference between a direct decay amplitude and a first- 
mix-then-decay path to the same final state. For the neutral B system, the effect can 
be observed by comparing decays into final CP eigenstates of a time-evolving neutral B 
state that begins at time zero as B? to those of the state that begins as B°: 


(Bs. (t) > fcr) — (Boys (t) > fer) 


ay.p(t) = = . 94 
e = TR. (0) > fee) + T(Boya(@) > for) (90 
This time dependent asymmetry is given, in general, bv 
1 — [Aje P? 2ImA; . 
Gg. (1) = ———- cos(Ampt) + ——-—5- sin(Amgt). 5 
fest ) m TUNE os( mg ) 1+ LUND sin( mp ) (95) 
In decays with |A/,p| = 1, (93) is the only contributing effect: 
Gg, (t) = IMA fp sin(Ampt). (96) 
We often use om) 
m 
Qfop = fce (97) 


1+ IN few |? 
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CP violation in the interference of decays with and without mixing has been observed 
for the neutral K system (Im £x # 0) and for the neutral B system (auk, # 0). In the 
latter, it is an effect of O(1). For such cases, the contribution from CP violation in mixing 
is clearly negligible. For decays that are dominated by a single CP violating phase (for 
example, B > y Ks and K; — 7°vi), so that the contribution from CP violation in decay 
is also negligible, a;., is cleanly interpreted in terms of purely electroweak parameters. 
Explicitly, ImA;,, gives the relative phase between the B — B mixing amplitude and the 
relevant decay amplitudes [see Equation (84)]: 


ImAjop = —N for sin[2(ġg + ór)]. (98) 


3.2.4 Direct and indirect CP violation 


The terms indirect CP violation and direct CP violation are commonly used in the lit- 
erature. While various authors use these terms with different meanings, the most useful 
definition is the following: 


» Indirect CP violation refers to CP violation in meson decays where the CP 
violating phases can all be chosen to appear in AF = 2 (mixing) amplitudes. 


» Direct CP violation refers to CP violation in meson decays where some CP 
violating phases necessarily appear in AF — 1 (decay) amplitudes. 


Examining Equations (85) and (71), we learn that CP violation in mixing is a manifes- 
tation of indirect CP violation. Examining Equations (89) and (76), we learn that CP 
violation in decay is a manifestation of direct CP violation. Examining Equations (93) 
and (78), we learn that the situation concerning CP violation in the interference of de- 
cays with and without mixing is more subtle. For any single measurement of ImA,; ¥ 0, 
the relevant CP violating phase can be chosen by convention to reside in the AF — 2 
amplitude [ó; = 0, óg # 0 in the notation of Equation (84)], and then we would call 
it indirect CP violation. Consider, however, the CP asymmetries for two different final 
CP eigenstates (for the same decaying meson), f, and fj. Then, a non-zero difference 
between ImA;, and ImA;, requires that there exists CP violation in AF = 1 processes 
(6r, — r # 0), namely direct CP violation. 


Experimentally, both direct and indirect CP violation have been established. Below 
we will see that £ signifies indirect CP violation while £^, signifies direct CP violation. 


Theoretically, most models of CP violation (including the Standard Model) have pre- 
dicted that both types of CP violation exist. There is, however, one class of models, that 
is superweak models, that predict only indirect CP violation. The measurement of £% # 0 
has excluded this class of models. 


4 K decays 


Measurements of CP violation have played an enormous role in particle physics. First, 
the measurement of £x in 1964 provided the first evidence that CP is not a symmetry of 
Nature. This discovery revolutionised the thinking of particle physicists and was essential 
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for understanding baryogenesis. Second, the measurement in 1988 of e% provided the first 
evidence for direct CP violation and excluded the superweak scenario. In the future, the 
search for CP violation in K — vv decays will add significantly to our understanding of 
CP violation. 


4. ex and e& 


Historically, a different language from the one used by us has been employed to describe 
CP violation in K — 77 and K — m£v decays. In this section we ‘translate’ the language 
of ex and &', to our notation. Doing so will make it easy to understand which type of 
CP violation is related to each quantity. 


The two CP violating quantities measured in neutral K decays are 


(n°n°|H| Kz) (ntr HIE) 
= ———————— — = rn 000 e 9 
707 OHK) "T Gata PAIK) (99) 
Define, for (17) = (00) or (—), 
ij 0 A inj [^o q Ay 
Ay = (m |H|K°), Ag = GP HIKT), Ag = [= . (100) 
P) y Ai 
Then 1-2 1—A 
— Aoo — À- 
= — = . 1 1 
Noo 14 Aoo’ n+ TFAL (101) 


The noo and n+- parameters get contributions from CP violation in mixing (|(q/p)|x 4 1) 
and from the interference of decays with and without mixing (ImA;; # 0) at O(1073) and 
from CP violation in decay (|A;j/A;j| # 1) at O(10 9). 


There are two isospin channels in K — 77 leading to final (27);-9 and (27),;—? states: 


(2| = Vi/3(m)i-a| — 2/300) 1-21, 
(ata | = V2/3(mm)isl + 1/3) al. (102) 


The fact that there are two strong phases allows for CP violation in decay. The possible 
effects are, however, small (on top of the smallness of the relevant CP violating phases) 
because the final J = 0 state is dominant (this is the AJ = 1/2 rule). Define 
— 0 A. jo _{@ EY 
A, = (n1), [H|K9), Ar = (ar) HIK’), A; = (= =}. (103) 
p K ET 


Experimentally, | A5/ Ao| = 1/20. Instead of nog and n4- we may define two combinations, 
€x and &&, in such a way that the possible effects of direct (indirect) CP violation are 
isolated into && (x). 


The experimental definition of the £x parameter is 


£k = z (noo + 2n+-). (104) 


1 
3 
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The experimental value is given by Equation (1). To zeroth order in Aj/Ao, we have 
Noo = N+- = Ey. However, the specific combination (104) is chosen in such a way that 
the following relation holds to first order in A2/A: 


ol- 


Tay (105) 


EK 
Since, by definition, only one strong channel contributes to Ap, there is indeed no CP 
violation in decay in (105). It is simple to show that Re ex # 0 is a manifestation of CP 
violation in mixing while Im £x # 0 is a manifestation of CP violation in the interference 
between decays with and without mixing. Since experimentally argex œ% 7/4, the two 
contributions are comparable. It is also clear that €x # 0 is a manifestation of indirect 
CP violation: it could be described entirely in terms of a CP violating phase in the Mi; 
amplitude. 


The experimental definition of the €; parameter is 


, 
EK 


(1+- — moo). (106) 


The quantity that is actually measured in experiment is 


= 6Re(e'/e). (107) 
n+- 


The world average is given in Equation (2). The theoretical expression is 


1 


gv -= Ay-). (108) 


ek & 
Obviously, any type of CP violation which is independent of the final state does not 
contribute to c. Consequently, there is no contribution from CP violation in mixing to 
(108). It is simple to show that Re c # 0 is a manifestation of CP violation in decay 
while Im c% # 0 is a manifestation of CP violation in the interference between decays 
with and without mixing. Following our explanations in the previous section, we learn 
that c #0 is a manifestation of direct CP violation: it requires ¢2 — o # 0 [where $; 
is the CP violating phase in the A; amplitude defined in (103)]. 


4.1.1 The ex parameter in the Standard Model 


An approximate expression for £x, that is convenient for calculating it, is given by 


e^ Im Mis 
= . 109 
EK V2 Amx ( ) 


A few points concerning this expression are worth emphasising: 


e Equation (109) is given in a specific phase convention, where Az is real. Within the 
SM, this is a phase convention where V,,,V,* is real, a condition fulfilled in both the 
standard parameterisation of Equation (46) and the Wolfenstein parameterisation 
of Equation (47). 
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e The phase of 7/4 is approximate. It is determined by hadronic parameters and 
therefore is independent of the electroweak model. Specifically, 


arg(ek) ~ arctan(-2Amk/AT x) e 7/4. (110) 


» A term of order gimA Re M < 0.02 is neglected when (109) is derived. 
e olm 12 


e There is a large hadronic uncertainty in the calculation of Mj? coming from long 
distance contributions. There are, however, good reasons to believe that the long 
distance contributions are important in Re Mi; (where they could be even com- 
parable to the short distance contributions), but negligible in Im Mis. To avoid 
this uncertainty, one uses Im Mijs/ Amy with the experimentally measured value of 
Amy, instead of ImMj2/2Re Mis with the theoretically calculated value of Re Mi. 


» The matrix element (K°|(8d)y_4(8d)y—4|K°) is yet another source of hadronic un- 
certainty. If both Im Mi» and Re Mi? were dominated by short distance contribu- 
tions, one would use the ratio ImMj,2/Re Mi? where the matrix element cancels out. 
However, as explained above, this is not the case. 


Within the Standard Model, Im M, is accounted for by box diagrams. We follow here 
the notations of reference [53], where precise definitions, numerical values and appropriate 
references are given. One obtains: 


er = e ^C, ByIm(V V4) (Re(VA Vm Sz) — neSo(tes1)} — Re Vig) moo) , 
(111) 
where C, = Cfi i is a well known parameter, the n; are QCD correction factors, 


6V 207A 


So is a kinematic face “and By is the ratio between the matrix element of the four quark 
operator and its value in the vacuum insertion approximation. 


We would like to emphasise the following points: 


» CP violation was discovered through the measurement of ey. Hence this measure- 
ment played a significant role in the history of particle physics. 


» For a long time, €x has been the only measured CP violating parameter. Roughly 
speaking, this measurement set the value of ókw (and, by requiring ôkm = O(1), 
made the KM mechanism plausible) but could not serve as a test of the KM mech- 
anism. (More precisely, a value of |ex| >> 107? would have invalidated the KM 
mechanism, but any value |ex| < 107? was acceptable.) It is only the combination 
of the new measurement of ayx, with ex that provides the first precision test of the 
KM mechanism. 


e Within the SM, the smallness of £x is not related to suppression of CP violation 
but rather to suppression of flavor violation. Specifically, it is the smallness of the 
ratio |(ViaVis) /(VuaVus)| ~ A* that explains |ex| ~ 1072. 


» Until recently, the measured value of € provided a unique type of information on 
the CKM phase. For example, the measurement of sign(Re £g) > 0 tells us that 
n > 0 and excludes the lower half of the p — plane. Such information cannot be 
obtained from any CP conserving observable. 
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« The €x constraint gives hyperbolae in the p — 5 plane. It is shown in fig. 1. The 
measured value is consistent with all other CKM-related measurements and further 
narrows the allowed region. 


e The main sources of uncertainty are in the Bx parameter, Be = 0.85 + 0.15, and 
in the |V;|1 dependence. 


e £x is an extremely powerful probe of new physics. Its small value poses a problem to 
any model of new physics where the flavor suppression is less efficient than the GIM 
mechanism [54] of the SM. For example, the construction of viable supersymmetric 
models is highly constrained by the requirement that they do not give contributions 
that are orders of magnitude higher than the experimental value. 


4.1.2 The &'. parameter in the Standard Model 


Direct CP violation in K — aa decays was first measured in 1988 [3]. Two recent 
measurements achieved impressive accuracy: 


(112) 


e eM KTeV [6], 
E 


~ | (05.32 2.6) x 1074 NA48 [7]. 
In combination with previous results [4, 5], the present world average has an accuracy of 
order 10% [see Equation (2)]. 
A convenient approximate expression for £% is given by: 
"n i | Ap 
K V2 Ao 


e'82~) sin($, — do). (113) 


Note that: 


» The approximations used in (113) are |g/p| = 1 and |42/Ao| < 1. 


» The phase of &4 is determined by hadronic parameters and is, therefore, model 
independent: arg(£,) = 1/2+ ó9 — ô = 1/4. The fact that, accidentally, arg(ex) ~ 
arge), means that 

Re(e'/&) zx e'/e. (114) 


e Ree # 0 requires 5) — ôo # 0, consistent with our statement that it is a manifes- 
tation of CP violation in decay. ej, # 0 requires $2 — do # 0, consistent with our 
statement that it is a manifestation of direct CP violation. 


The calculation of e’/e within the Standard Model suffers from large hadronic uncertain- 
ties. A very naive order of magnitude estimate gives &'/e ~ (Ap/Ao)(Ap™e" / Atree) ~ 
10-3. Note that &'/z is not small because of small CP violating parameters but because 
of hadronic parameters. 


The value of the phase 8x cancels in the ratio e'/e and therefore did not affect our 
estimate. In actual calculations, one usually uses the experimental value of £4; and the 
theoretical expression for £. Then the expression for <’/e depends on the CP violating 
phase. 
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The detailed calculation of e'/e is complicated. There are several comparable contri- 
butions with differing signs. The final result can be written in the form (for details and 
references, see [53]): 


e 


z= Im(ViaV,*) [p/m — pem] 
110MeV V f AQ. 
13 Im(V,,V,*) | —— —— — MS . 
m(Va D (eS) E 


(20 Qo 4p (__™ ) 
[s (1 — nay) — 0.488 (wo (115) 


2 


We have omitted here a phase factor using the approximation arg(éx) = arg(e‘). 
PO/2, which is dominated by QCD penguins, gives the contributions from AJ = 1/2 
transitions, while P(/2, which is dominated by electroweak penguins, gives the con- 
tributions from AI = 3/2 transitions. The BYP) and BE?) factors parameterise the 
corresponding hadronic matrix elements. The QCD penguin contributions are suppressed 
by isospin breaking effects (m, 4 ma), parameterised by (2,4,;. The resulting estimates 
vary in the range [53]: 

Re(e'/c)?M = (0.5 — 4) x 1073. (116) 


We would like to emphasise the following points: 


e Direct CP violation was discovered through the measurement of e'. 


e The SM range (116) is consistent with the experimental result (2). 


e The main sources of uncertainties lie then in the parameters mag, Bp». pe. 


Qnty and AQ. 'The importance of these uncertainties is increased because of the 
cancellation between the two contributions in (115). 


The large hadronic uncertainties make it difficult to use the experimental value of 
e'/e to constrain the CKM parameters. Still, a negative value or a value much 
smaller than 1074 would have been very puzzling in the context of the SM. 


» The experimental result is useful in probing and constraining new physics. 


4.2 CP violation in K > mvp 


Observing CP violation in the rare K — mvp decays would be experimentally very chal- 
lenging and theoretically very rewarding. It is very different from the CP violation that 
has been observed in K — a7 decays which is small and involves theoretical uncertainties. 
Similar to the CP asymmetry in B > Ks, it is predicted to be large and can be cleanly 
interpreted. Furthermore, observation of the K; — n'vo decay at the rate predicted by 
the Standard Model will provide further evidence that CP violation cannot be attributed 
to mixing (AS — 2) processes only, as in superweak models. 


Define 


Ardvi = (r°vo|H|K°), Asoup = (veni, Ave = B 
K 
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The ratio between the neutral K decay rates is then 


T(K, — n'vp) _ 1+ [A«ve ? — 2ReA;,, 


T(Ks  m?vb) — 1+ uel? + Rego (118) 


We learn that the K; — n?v? decay rate vanishes in the CP limit (A,,; = 1), as expected 
on general grounds [55]. (The CP conserving contributions were explicitly calculated 
within the Standard Model [56] and within its extensions with massive neutrinos [57] and 
with extra scalars [58] and found to be negligible.) 

CP violation in decay and in mixing are expected to be negligibly small, of order 
10-5 and 107-3, respectively. Consequently, Arvo is, to an excellent approximation, a pure 
phase. Defining 204 to be the relative phase between the K—K mixing amplitude and 
twice the s — dvi decay amplitude, namely Arup = e?*9*, we get from (118): 


T(K, > mvo) 


— tan? 
[(Ks > a9vi) tan’ x. (119) 


Using the isospin relation A(K° — n°vv)/A(K+ — mtv) = 1/2, we get 


_ T(K > n?vi) 9 
Qrup = T(K* 3 mp) sin’ Ox. (120) 


The present experimental searches give 


B(K* —^m*vv) = (1.5735) x 107” [59], 
B(K, > nv) < 59x 1077 [60]. (121) 


Equation (120) implies that arvo < 1. This inequality is based on isospin considera- 
tions only. Consequently a measurement of [(A+ — atvi) can be used to set a model 
independent upper limit on T(K; — 7°v7) [61]: 


B(K, > vi) < 44 B(K* a atv). (122) 


From the range given in (121) for the K+ decay, the isospin bound on the K; decay is 
B(K, > n?vv) < 2.6 x 107°, which is more than two orders of magnitude below the 
direct bound. 

Within the Standard Model, the K — «v» decays are dominated by short distance 
Z-penguins and box diagrams and can be expressed in terms of p and 7 (see [53] for 
details and references) 


B(K* + typ) = 411x107" [X (zs) A* [n? + (po — 9]. 


B(K, > rv) = 1.80 x 107 [X (r) Atn. (123) 
P(X , a 
Here pp = 1+ AEX (zy and X(z;,) and P(X) represent the electroweak loop contributions 
Xi 


in NLO for the top quark and for the charm quark, respectively. 


We would like to emphasise the following points: 
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« The K — rmvb decays are theoretically clean. The main theoretical uncertainty in 
the K* decay is related to the strong dependence of the charm contribution on the 
renormalisation scale and the QCD scale, Py(X) = 0.42 + 0.06. The K; decay has 
hadronic uncertainties smaller than a percent. 


e In the future, these decays will provide excellent p — 7 constraints. 
» Present constraints on the CKM parameters give the SM predictions [47]: 


B(K*—m*vp) = (7.01.9) x 10", 
B(K, on vv) = (2.91.1) x 107". (124) 


The experimental range for the K+ decay (121) is then consistent with the SM but 
not yet accurate enough to constrain it, while the experimental bound on the K; 
decay is still four orders of magnitude above the SM range. 


« The CP violations in K > mv? and in B > v Ks are strongly correlated and can 
provide the most stringent test of the Kobayashi-Maskawa mechanism. 


e The K — vr decays are interesting probes of CP violation related to new physics. 


5 D decays 


Within the Standard Model, D— D mixing is expected to be well below the experimental 
bound. Furthermore, effects related to CP violation in D—D mixing are expected to be 
negligibly small since this mixing is described to an excellent approximation by physics 
of the first two generations. An experimental observation of D— D mixing close to the 
present bound or, more strongly, of related CP violation, will then be evidence for New 
Physics. 


To explain how D— D mixing is searched for and how CP violation can be signalled, 
we use notation similar to that of the B system. We thus use Equation (63) to define the 
two mass eigenstates | D12), Equation (66) to define the average width T, Equation (67) 
to define the width and mass differences y and x, Equation (76) to define the decay 
amplitudes Ay and A; and Equation (78) to define Aj. 


5.1 D — Km and D — KK decays 


Processes that are relevant to the most sensitive measurements at present are the doubly- 
Cabibbo-suppressed D— K * 1- decay, the singly-Cabibbo-suppressed D?— K + K- decay, 
the Cabibbo-favored D0— K ^ 1* decay, and the three CP-conjugate decay processes. We 
follow here the analysis presented in reference [62]. We write down approximate expres- 
sions for the time-dependent decay rates that are valid for times t < 1/T'. We take into 
account the experimental information that x, y and tan@, are small. In particular, the 
smallness of tan 8, implies that 


Akl l Ak- <1. (125) 
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We expand each of the rates only to the order that is relevant to present measurements: 
T(D*(t) 3 K*n7] = eM Agen l'la/pl^ 
x (Due IH Reve, Du + Ims, a] reete?) o? 
T(D*(t) SK ^2] = e" Aga Ip/a? 
x [Duces PHReCue-s y+ A-ne) Pe (ee? Ft) 126) 


Tr[D*(t)9K* K ] = e IA x-? {1 + [Ree x-)y 7 Imc x-)z]Ft) , 

T[D* (t) K* K ] = eT Arrr- P {1 + [RO A )y — Im )a]rt) , (127) 
TID (t) —K rt] = e^" Ag-,«|?, 

[DKr] = € Ak, l*. (128) 


Within the Standard Model, the physics of D? — D9 mixing and of the tree level decays 
is dominated by the first two generations and, consequently, CP violation can be safely 
neglected. In almost all ‘reasonable’ extensions of the SM, the six decay modes of Equa- 
tions (126), (127) and (128) are still dominated by the SM CP conserving contributions 
[63, 64]. On the other hand, there could be new short distance, possibly CP violating, 
contributions to the mixing amplitude M2. Allowing for only such effects of new physics, 
the picture of CP violation is simplified since there is no direct CP violation. The effects 
of indirect CP violation can be parameterised in the following way [65]: 


la/p| = Rm, 
New = VR R; US+6D) 
ÀK- = VR Res (079p), (129) 
Akik- = —Rm en, 


Here R and Rm are real and positive dimensionless numbers. CP violation in mixing 
is related to Rm Z 1 while CP violation in the interference of decays with and without 
mixing is related to sin óp # 0. The choice of phases and signs in (129) is consistent with 
having ¢p = 0 in the SM and ô = 0 in the SU(3) limit. We further define 


x = rcosó-rysinó, 


y 


130 
^ = ycosó — xsind. (130) 


With our assumption that there is no direct CP violation in the processes that we 
study, and using the parameterisations (129) and (67), we can rewrite Equations (126), 
(127) and (128) as follows: 


TD (tH >Ktr] = e""lAg-,« p? 
Ro. 
x I» VR cos @p—z' sin enr egt eere ; 
T[D( (t)—2K rt] = e''|Ak-,. P? 


-2 
x e RR; (y' cos dp+z' sin óp)Tt- re (y! es?) mylasn 
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rD (t) Kt K^) = e "A k- P [1 — Rm(y cos p — z sin óp)T't], 
T[D (t) 3K^K ] = e"'|Ak«k-[? [1 — R} (y cos óp + z sin óp)Tt] ; (132) 
r[D'(t) 29K rt] = T(D* (t)  K*a]- e" "Ak-, p. (133) 


Of particular interest is the linear term in Equation (131) which is potentially CP 
violating [66, 67]. It is useful to define a CP violating quantity ap, x; which depends on 
the six measurable coefficients in (131): 


Re(Ax-a+)y + Im(Ak-5*)m — Re(Akl., .)y + Im(Al,)v 


a = 
D+Kn 2D k-|vz? + y? Nie, IV? F y? 
z' 
= —-sin dp. 134 
[x2 + y? $p ( ) 


Observing apax # 0 would be the most convincing evidence for new physics in D- D 
mixing. 

The CLEO measurement [68] gives the coefficient of each of the three terms [1, Ft and 
(Ft)?] in the doubly-Cabibbo suppressed decays (131). Such measurements allow a fit to 
the parameters R, Rm, z'sin $, y' cos, and z? + y?. Fit A of reference [68] quotes the 
following one sigma ranges: 


R = (0482 0.13) x 1072, 
y'cosóp = (-2.5*1$) x 107, 
z' = (0041.5) x 1077, (135) 
Am = 0.237095. 
sinóp = 0.0406. 


It is assumed here that Rm is not very different from one and can be parameterised by a 
small parameter Aj, 
Re? = 1+ Am. (136) 


We would like to make two further comments in this regard: 


e The experimental results in Equation (135) do not show any signal of CP violation, 
that is, both sin $5 and Am are consistent with zero. Consequently, there is no hint 
of new physics in the present results. 


e To test models of new physics, it would be useful to know the value of the strong 
phase ô. Such an estimate is a difficult theoretical task [69, 70, 71] but experimental 
data on related channels would be useful [72, 73]. 


For the singly-Cabibbo suppressed modes of Equation (132), several experiments fit 
the time dependent decay rates to pure exponentials. We define F to be the parameter 
that is extracted in this way. More explicitly, for a time dependent decay rate with 
T[D(t) > f] x e-'*(1 — zPt o ---), where |z| «& 1, we have '(D f) 2 T(1-4- z). The 
above equations imply the following relations: 

['(D° > K*^K-) T [1+ Rm(ycos 6p — xsin óp)], 
f(D 5 K*K-) P [1 + Rl (ycos óp + zsin$p)], (137) 
f 


(D > K-r*) = ÎD > Kr) =T. 


t 


I 
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Note that deviations of (D > K* K^) from T do not require that y # 0. They can be 
accounted for by x # 0 and sinóp Æ 0, but then they have a different sign in the D? 
and D? decays. Combining the two D > K+K- modes, one obtains the CP conserving 
quantity ycp: 

f(DoK*K-) 
['(D° > K-2*) 


Am. 
yop = —1l= gycosóp — “> vsin $, (138) 


where we made the approximations of zero production asymmetry and small Am [62]. 
The one sigma ranges measured by various experiments are given by 


(3.4 + 1.6) x 107? FOCUS [74] 
(0.8+ 3.1) x 107? E791 [75] 


yer =) (1.1429) x 10-2 CLEO [76] (139) 
(0.5 + 1.3) x 107? BELLE [77] 
giving a world average of 
yop = (1.3 + 0.9) x 107°. (140) 


Finally, we note that direct CP violation has been searched for in the Cabibbo-favored 
[78], singly-Cabibbo-suppressed [79, 80, 81] and doubly-Cabibbo-suppressed [68] decays 
with all results consistent with zero. 

We conclude that at present there is no evidence for mixing and certainly not for CP 
violation in the neutral D system. These results are consistent with the SM and constrain 
models of new physics. If evidence is found in the future, the D —^ Kr and D> KK 
decays will provide rich enough information that we will be able to point out the origin 
of the signals in much detail. 


6 B decays 


6.1 CP violation in mixing 


CP violation in mixing is related to a non-zero value for the following quantity [see (74)]: 


q 1 GE) 
1 ~ -Im . 141 
B 2 Mi» ( ) 


The effect can be isolated by measuring the asymmetry in semileptonic decays [see (87)]: 
as, — 2(1 — la/pl) x Im(l;5/M13). (142) 
This has been searched for in several experiments, with sensitivity at the level of 107?: 


(L42-42)x10-? CLEO [82] 

_ J} (0445.7) x10? OPAL [83] 
75. =) (_1.942.8) x 107? ALEPH [84] 
(0.48 + 1.85) x 1072? BABAR [85] 


(143) 


giving a world average of 
as, = (0.2 + 1.4) x 107°. (144) 
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As explained above, in the Bg system we expect, in a model independent way, that 
[Ui2/Mig| «& 1. Within any given model we can actually calculate the two quantities from 
quark diagrams. Within the SM, Mj» is given by box diagrams. For both the By and B, 
systems, the long distance contributions are expected to be negligible and the calculation 
of these diagrams with a high loop momentum is a very good approximation. Lj» is 
calculated from a cut of box diagrams [86]. Since the cut of a diagram always involves 
on-shell particles and thus long distance physics, the calculation is, at best, a reasonable 
approximation to L'i3. (For l'i2(B,) it has been shown that local quark-hadron duality 
holds exactly in the simultaneous limit of small velocity and large number of colors. We 
thus expect an uncertainty of O(1/Nc) ~ 30% [87, 88]. For l'j2(B4) the small velocity 
limit is not as good an approximation but an uncertainty of order 5096 still seems a 
reasonable estimate [89].) 


Within the Standard Model, Mi; is dominated by top-mediated box diagrams (see 
[53] for details and references): 


2, 


M 
2 = Igy 


—LÉmgminsBsf2(Vy Via)? So(z:), (145) 
where So(z,) is a kinematic factor, ng is a QCD correction, and Bp ff parameterises the 
hadronic matrix element. For l5, we have [91, 92, 93] 


T = -CÈ mn m?Be S (Va Via) 
12 — An B! DBJ B 
5 m3 Bs m? V. nw 

x 2K,- K3)48(Ki4-K5)—5 -2 9| (146 

3 (my-m4)? Ba T 1+Ka)+8(Ki + Ko) 5 mê Va Vix (146) 


(Ka—-Ki)+3 
where Kı = —0.39 and K = 1.25 [93] are combinations of Wilson coefficients and Bs 
parameterises the (S — P)? matrix element. New physics usually takes place at a high 
energy scale and is relevant to the short distance part only. Therefore, the SM estimate in 
Equation (146) remains valid model independently. Combining (145) and (146), we learn 
that [Py2/Mi2| = O(m?/m2), which confirms our model independent order of magnitude 
estimate, |[}2/Mj2| € 1077. For the imaginary part of this ratio, we have 


Ii? -3 7) 
ası = In— x —1.4 x 10 7—— ———5. 
SL "Ma (1 —p)? +7? 
The suppression by a factor of O(10) of ası compared to |T 12/Mı2| comes from the fact 
that the leading contribution to li has the same phase as Mj. Consequently we have 
as, = O(m/m2). The CKM factor does not give any further significant suppression, 


(147) 


* 
Im put = O(1). In contrast, for the B, system, where the same expression holds 
tb “td 


except that V;4/Vi is replaced by V.5/Vis, there is an additional CKM suppression from 
Va Va \ 2 
im (ai) = on. 

In the SM and in most of its reasonable extensions, both T3 and b > ccs transitions 
are dominated by SM tree level decays. Consequently, new physics affects ag, and ayx, 
only through its contributions to Mis. This leads to interesting correlations between as, 
and ayx, that can be used to probe flavor parameters [94, 95]. Conversely, one can use 
the measured value of ax, to give model independent predictions for ası, [96, 97]. 
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6.2 Penguin pollution 


In purely hadronic B decays, CP violation in decay and in the interference of decays with 
and without mixing is > O(107?). We can therefore safely neglect CP violation in mixing 
in the following discussion and use 

q Vis Vea 


= WB. 
P Vo Vä 


(148) 


(From here on we omit the convention-dependent quark phases w, defined in Equa- 
tion (79). Our final expressions for physical quantities are of course unaffected by such 
omission.) 


À crucial aspect of our discussion is the number of relevant weak phases for a given 
decay process: 


« If there is a single weak phase that dominates the decay, CP violation in decay will 
be small and difficult to observe. On the other hand, CP asymmetries in neutral 
B decays into final CP eigenstates are subject to clean theoretical interpretation: 
we will either have precise measurements of CKM parameters or be provided with 
unambiguous evidence for new physics. 


If there are two (or more) weak phases that contribute comparably, hadronic uncer- 
tainties will appear in the theoretical interpretation of CP violation in the interfer- 
ence of decays with and without mixing. On the other hand, if there are also large 
strong phase differences, CP violation in decay can be observed in the corresponding 
charged and neutral B decays. 


In many cases of interest, different weak phases are carried by tree and penguin con- 
tributions. The difficulties arising from hadronic uncertainties related to comparable tree 
and penguin contributions have become known as "penguin pollution." 


To illustrate the problem, we will consider two relevant CP asymmetries. First, the CP 
asymmetry in B > V Ks is an example of a case where the penguin pollution is negligibly 
small and a theoretically very clean interpretation of the experimental measurement is 
possible. Second, the CP asymmetry in B — s is an example of a case where penguin 
pollution cannot be a-priori ignored. We also list various ways in which the problem might 
be overcome. 


63 BowkKs 


The first evidence for CP violation outside K decays has been provided by the recent 
BaBar and Belle measurements of the CP asymmetry in B > Ks, 


{ 0.59+0.15 Babar [11] 
aykKs = 


0.99+0.15 Belle [12] (149) 


These results in combination with previous ones [8, 9, 10] give the world average quoted 
in Equation (3). The process B — 9 Ks is one where the penguin contribution is harmless 
and the CP asymmetry is subject to an impressively clean theoretical interpretation. 
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The decay is mediated by the quark transition b > éc&. It gets contributions from a 
tree level diagram and from penguin diagrams with intermediate u, c and t quarks. Using 
the unitarity relation (54), we can write the various contributions in terms of two CKM 
combinations: 


A(b > c3) = (Tos + P? — P) Va Vos + (Po — Pi) Va Vas: (150) 


The second term is suppressed by two factors. First, there is the ratio between penguin 
and tree contributions, 
wk — Pe PP Pi | Gs aT (o Ks" T^ sey, T"c| B?) 
PT © Tek o Tad Pe- Pt 12r  m| (ntn [br y^creryusu| B9) 


(151) 


The term in the square brackets is O(0.03) but the ratio of matrix elements may 
partially compensate for this suppression. Secondly, there is the ratio of CKM elements, 
(V, SV, /(VV..)] ~ A2. We conclude that the second term is suppressed by a factor of 
rend? X 107? and we can safely neglect Pyrs. Thus the B > yK decay is dominated by 
a single weak phase, that is, arg(V; V... ). 


Neglecting Py, means that, to a very good approximation, we have |Ayk,| = 1, 
ayks = ImAyss, (152) 
and that the experimental value of ayx,, [Equation (3)] can be cleanly interpreted in terms 
of a CP violating phase. 


A new ingredient in the analysis is the effect of K —K mixing. For decays with a single 
Ks in the final state, K—K mixing is essential because B? > K? and B? — K?, and 
interference is possible only due to K — K mixing. This adds a factor of 


p — Vis Vi * 
ONE (159 
into (A/A): - 
AyKs Va VA V VA * 
= CM 1 
Aus P5 AVIV, Aaa) n 096 
The CP-eigenvalue of the state 1s yk, = —1. Combining Equations (148) and (154), we 
find 
Va V, Va Væ \ [VV 
À B K = th "td cb "cs cd" cs ; n 
a rey = - (EvE) (o) Gn (155) 
which leads to 
GyKg = sin 28. (156) 


What we have learned above is that Equation (156) is clean of hadronic uncertainties to 
O(rPA X?) < 107?. This means that the measurement of ayx, can give the theoretically 
cleanest determination of a CKM parameter, even cleaner than the determination of 
[Vas] from K — rev. [If BR(K, — vvv) is measured, it will give a comparably clean 
determination of r.] 
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Taking into account all the constraints on the CKM parameters ezcept for the ay, 
measurements, the SM prediction is [47] 


sin 20 = 0.68 + 0.18, (157) 


consistent, with the experimental result (3). This consistency has important implications. 
In particular, 


» the Kobayashi-Maskawa mechanism has successfully passed its first precision test; 


» models of approximate CP which, by definition, predict |ayx,| « 1, are excluded. 


64 B-— mmc 
The CP asymmetry in the B — rtr mode has the form 


l- Aur 2 2T Ann 
en cos Amt + m 


Gs (t) = ^1 + [Arz]? 1+ [As]? 


sin Amt. (158) 


Recently, the BaBar collaboration presented the first constraints on this asymmetry [99]: 


2ImA 
TT = 0.0 +0.54 
T+ Dae Pos 
1 — Ass? 


The results are not yet precise enough to give useful constraints. But we discuss this 
mode to show how penguin pollution arises and how it complicates the analysis. 


The decay is mediated by the quark transition b — üud. It gets contributions from a 
tree level diagram and from penguin diagrams with intermediate u, c and ¢ quarks. Using 
the unitarity relation (55), we can write the various contributions in terms of two CKM 
combinations: 

A(b > üud) = (Tuia + Pi — Pi) Vas Vua + (Pa — Pa) Vis Via- (160) 
FI 
V*y 
VdV ud 
Vi Va| are of O(43), the second term is suppressed only by the 


The ratio between magnitudes of the second and first terms is given by rpp 


Since both |V p Va| and 
factor rp, where 


Pu Pt — P$ 
TPT cT. 4 pu pe’ 
Ti Taaa P1? — Pi 
One may make a rough estimate of |P,,/T;.| from the decay B —> Kr, which can be 
parameterised as follows: 


(161) 


A(B? + K*n ) = Ty, VAV,,  PeaVas Va. (162) 


In this case |Pk,/Tk4| = O(rÉz/37). If QCD enhances the penguin contribution to 
B — rr by a significant amount, that is rper >> A’, then B — Kr would be dominated by 
the penguin process. Let us provisionally make the following assumptions: (i) flavor SU(3) 
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symmetry in the QCD matrix elements; (ii) electroweak penguins and "color suppressed" 
processes are negligible; (iii) penguins dominate B — Kz, so Tx, may be ignored in 
BR(B? + K*-); (iv) penguins make a small enough contribution to B > az that Prr 
may be ignored in BR(B? > ntr). Then 


Pa 
Tan 


Prr 
Pkr 


Pk E 
Tar 


Vab Vud 
Vis Vo 


yy te 
(Frere T 2) l (163) 


BR(B? > v*2-) 


Recent measurements [100, 101, 102] give world averages BR(B? — atx) = (44+ 
0.9) x 10-9 and BR(B? 5 K*z-) = (17.3 + 1.5) x 107°. We thus find BR(B? > 
K*z-)/BR(B? > t*r) = 3.9 and obtain the rough estimate 


IZZ] ~ 0.2 — 0.3. (164) 


It is clear that penguin effects are unlikely to be negligible in B > 7. 
Combining Equations (148) and (160), we find 


Vai) (Vaka) [rere a]. 
Vo Va) NVabVaa) Ll Be (Vb Via) / (Vas Vua) 


A(B A v1) = (165) 


If the last factor could be approximated by unity, that is rp = 0, we would obtain 
|Ann| = 1 and 
arz = sin 2a. (166) 


This approximation is however unjustified. To get an idea of the effects of Prr # 0, we 
give the leading corrections due to a small [rper]: 


[x] = 1—2(R,/Ry)Im(rp7) sina, 
Imà 002 a (167) 
Ta = sin2a+ 2(R,/ R,) Re(rpr) cos 2asin a. 


(For a more detailed discussion, see [98].) Note that if strong phases can be neglected, 
rpr is real and |A,4| = 1 would be a good approximation. But it is not clear whether the 
strong phases are indeed small. In any case, one needs to know rpy to extract a from 
a44(t). This is the problem of penguin pollution. 


A variety of solutions to this problem have been proposed, falling roughly into two 
classes. The first type of approach is to convert the estimate given above into an actual 
measurement of |Px_|. (The list of papers on this subject is long. Early works include 
[103, 104, 105]. For a much more comprehensive list of references, see [98].) Once |Pxx| 
is known, flavor SU (3) is used to relate |Py4| to | P; |. One must then include a number 
of additional effects: 


e Electroweak penguins. The effects are calculable [106]. 


» Color suppressed and re-scattering processes. These must be bounded or estimated 
using data and some further assumptions. 


e SU(3) corrections. Some, such as fx/f,, can be included, but SU(3) corrections 
generally remain a source of irreducible uncertainty. 
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The second type of approach is to exploit the fact that the penguin contribution to Pyr 
is pure AJ = 1/2, while the tree contribution to T4, contains a piece which is AJ = 3/2. 
(This is not true of the electroweak penguins [107], but these are expected to be small.) 
Isospin symmetry allows one to form a relation among the amplitudes B? — a+ 
B? > n?r?, and Bt > ntr, 


T. 


1 
V2 


There is also a relation for the charge conjugate processes. A simple geometric construc- 
tion then allows one to disentangle the unpolluted AJ = 3 amplitudes, from which sin 2a 
may be extracted cleanly [108]. 


A(B9 —5m*) + A(B9 5 a?8?) = A(B* —m* m). (168) 


The key experimental difficulty is that one must measure accurately the flavor-tagged 
rate for B? — q?z?. Since the final state consists of only four photons, and the branching 
fraction is expected to be of O(10 9), this is very hard. It has been noted that an upper 
bound on this rate, if sufficiently strong, would also allow one to bound P,, usefully 
[109, 98, 110]. 


An alternative is to perform an isospin analysis of the process BÓ — pr — ntron?’ 
[111, 112, 113, 114]. Here one must study the time-dependent asymmetry over the entire 
Dalitz plot, probing variously the intermediate states ptx* and pr’. The advantage 
here is that the final states with two 7°’s need not be considered. On the other hand, 
thousands of cleanly reconstructed events would be needed. 


Finally, one can attempt to calculate the penguin matrix elements. Model-dependent 
analyses are not really adequate for this purpose, since the goal is the extraction of 
fundamental parameters. Precise calculations of such matrix elements from lattice QCD 
are far in the future, given the large energies of the 7's and the need for an unquenched 
treatment. Recently, a new QCD-based analysis of the B — «7 matrix elements has been 
proposed [115, 116, 117, 118]. For details, see [119]. 


7 CP violation in supersymmetry 


7.1 CP violation as a probe of new physics 


We have argued that the Standard Model picture of CP violation is rather unique and 
highly predictive. We have also stated that reasonable extensions of the Standard Model 
have a very different picture of CP violation. Experimental] results are now starting to 
decide between the various possibilities. Our discussion of CP violation in the presence 
of new physics aims to demonstrate that, indeed, models of new physics can significantly 
modify the Standard Model predictions and that measurements in the near future will 
therefore have a strong impact on the theoretical understanding of CP violation. 


To understand how the Standard Model predictions could be modified by new physics, 
we focus on CP violation in the interference between decays with and without mixing. 
As explained above, this type of CP violation may give, due to its theoretical cleanliness, 
unambiguous evidence for new physics most easily. We now list some of the questions can 
be answered when many such observables are measured. 
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» Is there new physics in B — B mixing? 


Consider ayx,, the CP asymmetry in B-4Ks. This measurement will cleanly 
determine the relative phase between the B — B mixing amplitude and the b > 
cés decay amplitude (sin 26 in the SM). The b — cés decay has Standard Model 
tree contributions and therefore is very unlikely to be significantly affected by new 
physics. On the other hand, the mixing amplitude can be easily modified by new 
physics. We parameterise such a modification by a phase 64: 


204 = arg(Miz/ Mi") (169) 
so that the the SM prediction for ayr, becomes 
QGKg = sin[2(8 + a)l. (170) 


64 # 0 would be clear evidence of new physics (see e.g. [120]). 


It is interesting to note that already now the measured value of ayx, (3), which is 
consistent with the SM range, excludes many models that require a modification of 
CP violation in B — B mixing due to new physics. Among these are various models 
of soft CP violation [121, 122] aimed at solving the strong CP problem, models of 
geometric CP violation due to extra dimensions [123], models of spontaneous CP 
violation in the left-right symmetric framework [124, 125], and several models that 
aim to solve the supersymmetric CP problems [126, 127, 128]. 


Is the new physics related to AB — 1 or AB — 2 processes, or both? 


Consider agx,, the CP asymmetry in B — óKs. This measurement will cleanly 
determine the relative phase between the B — B mixing amplitude and the b — s3s 
decay amplitude (sin 28 in the SM). The b — s8s decay has only Standard Model 
penguin contributions and therefore is sensitive to new physics. We parameterise 
the modification of the decay amplitude by a phase 04 [129]: 


Ba = arg(Agx,/A3K,)- (171) 


Then 
apks = sin[2(8 + ba + 04)]. (172) 


Comparing agx; to @yx,, that is, examining whether 84 Æ 0, will tell us if the new 
physics is related to AB = 1 or AB = 2 processes. 


Is the new physics related to the third generation, or all generations? 


Consider a,,5, the CP violating ratio of K — vv decays, defined in (120). This 
measurement will cleanly determine the relative phase between the K — K mixing 
amplitude and the s — dv? decay amplitude (of order sin? 8 in the SM). The 
experimentally measured small value of £& requires that the phase of the K — 
K mixing amplitude is not modified from the Standard Model prediction. (More 
precisely, it requires that the phase of the mixing amplitude is very close to twice 
the phase of the s + dau decay amplitude [130].) On the other hand, the decay, 
which in the SM is a loop process with small mixing angles, can be easily modified 
by new physics. Examining whether the SM correlation between arp and ayx, is 
fulfilled will give us information on the generations involved. 
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e Is the new physics related to the down sector or the up sector or both? 


Consider appn, the CP violating quantity in D + K+n* decays defined in (134). 
It depends on @p, the relative phase between the D—D mixing amplitude and the 
€ — dà8u and c — sdu decay amplitudes. Within the Standard Model, the two 
decay channels are tree level. It is unlikely that they are affected by new physics. 
On the other hand, the mixing amplitude can be easily modified by new physics. 
Examining whether apgr = 0, that is, whether óp (and/or 64)Z 0, will provide 
information on the sectors involved. 


e Are the new sources of CP violation flavor changing, flavor diagonal or both? 


Consider dy, the electric dipole moment of the neutron. We did not discuss this 
quantity so far because, unlike CP violation in meson decays, flavor changing cou- 
plings are not necessary for dy. In other words, the CP violation that induces dy 
is flavor diagonal. It does in general get contributions from flavor changing physics, 
but it could be induced by sectors that are flavor blind. Within the SM (and ignor- 
ing 6gcp), the contribution from éxm arises at the three loop level and is at least 
six orders of magnitude below the experimental bound (7). If the bound is further 
improved (or a signal observed), we may elucidate the flavor dependence. 


It is no wonder then that with such rich information, flavor and CP violation provide 
an excellent probe of new physics. We will now demonstrate this situation more concretely 
by discussing CP violation in supersymmetry. 


7.2 The supersymmetric framework 


Supersymmetry solves the fine-tuning problem of the Standard Model and has many other 
virtues. But at the same time, it leads to new problems: baryon number violation, lepton 
number violation, large flavor changing neutral current processes and large CP violation. 
The first two problems can be solved by imposing R-parity on supersymmetric models. 
There is no such simple, symmetry-related solution to the problems of flavor and CP vio- 
lation. Instead, suppression of the relevant couplings can be achieved by demanding very 
constrained structures of the soft supersymmetry breaking terms. There are two impor- 
tant questions here. First, can theories of dynamical supersymmetry breaking naturally 
induce such structures? (For an excellent review of dynamical supersymmetry breaking, 
see [131].) Second, can measurements of flavor changing and/or CP violating processes 
shed light on the structure of the soft supersymmetry breaking terms? Since the answer 
to both questions is in the affirmative, we conclude that flavor changing neutral current 
processes and, in particular, CP violating observables will provide clues to the crucial 
question of how supersymmetry breaks. 


7.2.4 CP violating parameters 


A generic supersymmetric extension of the Standard Model contains a host of new fla- 
vor and CP violating parameters. (For a review of CP violation in supersymmetry see 
[132, 133].) It is an amusing exercise to count the number of parameters [134]. The su- 
persymmetric part of the Lagrangian depends, in addition to the three gauge couplings 
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of Gsm, on the parameters of the superpotential W: 


W = D (Y? H.QuU,; + YA HaQuiD 1; + Yf H4LuE,;) + BH, Hg. (173) 
hj 
In addition, we have to add soft supersymmetry breaking terms: 
Lsot = — (4t, QuUu, + At HQ iD; + Ab HaluiE 1 + BH,He+ he.) 
- IÅ; 
— D (m$) Av Ay — 3 D (ma (AA) + hc.) . (174) 
all scalars (a)=1 


where S = Q,, Dj,U,, Li, Ej. The three Yukawa matrices Y ^ depend on 27 real and 
27 imaginary parameters. Similarly, the three A/-matrices depend on 27 real and 27 
imaginary parameters. The five m2 hermitian 3 x 3 mass-squared matrices for sfermions 
have 30 real parameters and 15 phases. The gauge and Higgs sectors depend on 


Üocp, Thay, mi), Tha), G15 92, 93) by B, mj, mi, (175) 


that is 11 real and 5 imaginary parameters. Summing over all sectors, we get 95 real and 
74 imaginary parameters. The various couplings (other than the gauge couplings) can be 
thought of as spurions that break a global symmetry, 


U(3) x U(1)eg x U(1)g > U(1)a x U(1). (176) 
The U(1)eg x U(1)n charge assignments are: 
H, H, QU QD LE 
Ulm 1 1 -1 -1 -1. (177) 
Udy, 1 1 1 1 1 
Consequently, we can remove 15 real and 30 imaginary parameters, which leaves 


1294 = Ua real physical parameters. (178) 

44 imaginary physical parameters. 
In particular, there are 43 new CP violating phases! In addition to the single Kobayashi- 
Maskawa of the SM, we can put 3 phases in Mi, Mo, p (we used the U(1)po and U(1)g to 
remove the phases from 4B* and M3, respectively) and the other 40 phases appear in the 
mixing matrices of the fermion-sfermion-gaugino couplings. (Of the 80 real parameters, 
there are 11 absolute values of the parameters in (175), 9 fermion masses, 21 sfermion 
masses, 3 CKM angles and 36 SCKM angles.) Supersymmetry provides a nice example 
to our statement that reasonable extensions of the Standard Model may have more than 
one source of CP violation. 


The requirement of consistency with experimental data provides strong constraints on 
many of these parameters. For this reason, the physics of flavor and CP violation has 
had a profound impact on supersymmetric model building. A discussion of CP violation 
in this context can hardly avoid addressing the flavor problem itself. Indeed, many of 
the supersymmetric models that we analyze below were originally aimed at solving flavor 
problems. For details on the supersymmetric flavor problem, see [135]. 


For CP violation, one can distinguish two classes of experimental constraints. First, 
bounds on nuclear and atomic electric dipole moments determine what is usually called 
the supersymmetric CP problem. Second, the physics of neutral mesons and, most impor- 
tantly, the small experimental value of £y pose the supersymmetric eg problem. In the 
next two subsections we describe the two problems. 
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7.2.2 The supersymmetric CP problem 


One aspect of supersymmetric CP violation involves effects that are flavor preserving. 
Then, for simplicity, we describe this aspect in a supersymmetric model without additional 
flavor mixings, i.e. the minimal supersymmetric standard model (MSSM) with universal 
sfermion masses and with the trilinear SUSY-breaking scalar couplings proportional to 
the corresponding Yukawa couplings. (The generalisation to the case of non-universal soft 
terms is straightforward.) In such a constrained framework, there are four new phases 
beyond the two phases of the SM (ôkm and 6ocp). One arises in the bilinear p-term of 
the superpotential (173), while the other three arise in the soft supersymmetry breaking 
parameters of (174): rh (the gaugino mass), A (the trilinear scalar coupling) and B (the 
bilinear scalar coupling). Only two combinations of the four phases are physical [136, 137]: 


ġa — arg(A*m), og = arg(mpuB"*). (179) 


In the more general case of non-universal soft terms there is one independent phase ġa, 
for each quark and lepton flavor. Moreover, complex off-diagonal entries in the sfermion 
mass-squared matrices represent additional sources of CP violation. 


The most significant effect of $4 and @g is their contribution to electric dipole moments 
(EDMs). For example, the contribution from one-loop gluino diagrams to the down quark 
EDM is given by [138, 139]: 

d “T (Al sin ġa + tan Alu] sin de) 180) 
= mai sin an sin 
d 418m A H B), ( 
where we have taken mg ~ mp ~ mz ~ m?, for left- and right-handed squark and gluino 
masses. We define, as usual, tan 8 = (H,)/(H4). Similar one-loop diagrams give rise to 
chromoelectric dipole moments. The electric and chromoelectric dipole moments of the 
light quarks (u, d, s) are the main source of dy (the EDM of the neutron), giving [140] 


100 GeV 


2 
) sinda.p x 10-2 eem, (181) 
m 


dy ~2 ( 
where, as above, m represents the overall SUSY scale. In a generic supersymmetric 
framework, we expect m = O(mz) and sinó4,s = O(1). Then the constraint (7) is 
generically violated by about two orders of magnitude. This is the Supersymmetric CP 
Problem. 


Equation (181) shows two possible ways to solve the supersymmetric CP problem: 


»« Heavy squarks: m 2 1 TeV. 


» Approximate CP: sin $4,p « 1. 


Recently, a third way has been investigated, that is cancellations between various contri- 
butions to the electric dipole moments. However, there seems to be no symmetry that can 
guarantee such a cancellation. This is in contrast to the other two mechanisms mentioned 
above that were shown to arise naturally in specific models. We therefore do not discuss 
any further this third mechanism. 
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The electric dipole moment of the electron is also a sensitive probe of flavor diagonal 
CP phases. The present experimental bound, 


|de] < 4 x 10?" ecm/141], (182) 


is also violated by about two orders of magnitude for *natural' values of supersymmetric 
parameters. A new experiment [142] has been proposed to search for the electric dipole 
moment of the muon at a level smaller by five orders of magnitude than present bounds; 
such improvement will make d, another sensitive probe of supersymmetry [143]. 


7.2.8 The supersymmetric €g problem 


The supersymmetric contribution to the £x parameter is dominated by diagrams involving 
Q and d squarks in the same loop. For m = mg ~ mq ^ mp (our results depend only 
weakly on this assumption) and focusing on the contribution from the first two squark 
families, one gets (see, for example, [144]): 


5 03 fkmk ( mk ) 3 4 4 
= Im [(64,) 21.(54) na] - 183 
EK 1622 M Amy |\Ms + mq + 55 m [( 12)LL( fo) RR (183) 
Here 
m m. 
(do) = ( e & et 
m? — m 
(ó)nm = (=a) Leur (184) 
D 


where Ké! (kK) are the mixing angles in the gluino couplings to left-handed (right- 
handed) down quarks and their scalar partners. Note that CP would be violated even if 
there were two families only [145]. Using the experimental value of €x, we get 


2.2 2 m 
(Amxek)"9Y — o, (300 GeV ? [Mo 7 Mh, | (mb, ^5, IK KAR sing, 
(am 3 3 12 4819 

KEK) m mg mb 


(185) 
where $ is the CP violating phase. In a generic supersymmetric framework, we expect 
m = O(mz), Mb p/m, p = O(1), Kg? = O(1) and sing = O(1). Then the constraint 
(185) is generically violated by about seven orders of magnitude. 


The Am x constraint on Re | (5%): (8%) gg| is about two orders of magnitude weaker. 
12 12 


One can distinguish then three interesting regions for (65) = \/(64)11 (6%) rr : 
0.003 « (62) excluded, 
0.0002 « (021) < 0.003 viable with small phases, (186) 


(02) « 0.0002 viable with O(1) phases. 


The first bound comes from the Amy constraint (assuming that the relevant phase is not 
particularly close to 7/2). The bounds here apply to squark masses of order 500 GeV and 
scale like m. There is also dependence on mg/m, which is here taken to be one. 


Equation (185) shows other possible ways of solving the supersymmetric £y problem: 
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« Heavy squarks: m >> 300 GeV; 
e Universality: dma p < mo p; 
» Alignment: |K%| « 1; 

e Approximate CP: sing < 1. 


7.2.4 A supersymmetric e'/e? 


In this section we discuss the question of whether supersymmetric contributions to e'/e 
can be dominant. A typical supersymmetric contribution to ¢’/e is given by [146] 


e 


o,(m;) ]9 /158 MeV (500 GeV 
Edd ( ) ( , (187) 


ener ) [em [tte - Ste] 


where Bg parameterises the matrix element of the relevant four-quark operator. Conse- 
quently, the supersymmetric contribution saturates ¢’/e for 
ma 
Im (fn) — 694] ~ (SE) 1 
(ODPLIODM 500 Gav (188) 


where, motivated by flavor symmetries (see below), we parameterise the suppression by 
powers of A ~ 0.2. Without proportionality, a naive guess would give 


= 58Bg | 


Ms +Ma fs 


(iuo ~ eMe us seems 
m m 
ó — n ASTE 1 
( LR)21 TANTA m ( 89) 


This is not far from the value required to account for e'/e [147]. Thus, it is certainly 
possible that supersymmetry accounts for, at least, a large part of e'/e (see, for example, 
the models of references [148]-[154]). Yet, it has been argued that such a situation is 
not generic [155]. The problem is that Equation (189) gives an overestimate of the 
supersymmetric contribution in most viable models of supersymmetry breaking that have 
appeared in the literature. We will encounter concrete examples of this statement when 
we survey the various supersymmetric flavor models. 


7.3 Supersymmetry breaking and flavor models 


Before turning to a detailed discussion, we define two scales that play an important role 
in supersymmetry: As, where the soft supersymmetry breaking terms are generated, and 
Ar, where flavor dynamics takes place. When Ar > As, it is possible that there are 
no genuinely new sources of flavor and CP violation. This leads to models with exact 
universality. When Ar < As, we do not expect, in general, that flavor and CP violation 
are limited to the Yukawa matrices. One way to suppress CP violation would be to assume 
that, similarly to the Standard Model, CP violating phases are large, but their effects are 
screened, possibly by the same physics that explains the various flavor puzzles, such as 
models with Abelian or non-Abelian horizontal symmetries. It is also possible that CP 
violating effects are suppressed because squarks are heavy. Another option, which is now 
excluded, was to assume that CP is an approximate symmetry of the full theory (namely, 
CP violating phases are all small). 
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7.3.1 Gauge mediated supersymmetry breaking 


If at some high energy scale squarks are exactly degenerate and the A terms proportional 
to the Yukawa couplings, then the contribution to £y comes from Renormalisation Group 
Evolution (RGE) and is GIM suppressed, that is 


log(As/mw)]? 
MEE (190) 


ex X Im [(V, V2] Y? | 
This contribution is negligibly small [136]. The contribution from genuinely supersym- 
metric phases (i.e. the phases in A, and p) is also negligible [156, 157]. (This does not 
necessarily mean that there is no supersymmetric effect on £p. In some small corner of 
parameter space the supersymmetric contribution from stop-chargino diagrams can give 
up to 20% of e [158, 159].) 


In Gauge Mediated Supersymmetry Breaking (GMSB) [160, 161], superpartner masses 
are generated by the SM gauge interactions. These masses are then exactly universal at 
the scale Ag at which they are generated (up to tiny high order effects associated with 
Yukawa couplings). Furthermore, A terms are suppressed by loop factors. The only 
contribution to £x is then from the running, and since Ag is low it is highly suppressed. 


These models can also readily satisfy the EDM constraints. In most models, the A 
terms and gaugino masses arise from the same supersymmetry breaking auxiliary field, 
that is, they are generated by the same SUSY and U(1)g breaking source. They therefore 
carry the same phase (up to corrections from the Standard Model Yukawa couplings), and 
ga vanishes to a very good approximation: 


(191) 


log(As/mw)]* 
167? 


ba x YAY? ¥ Jon | 


The resulting EDM is dy x, 107?! ecm. This maximum can be reached only for very large 
tan 8 ~ 60 while, for small tan B ~ 1, dy is about 5 orders of magnitude smaller. This 
range of values for dy is much below the present (~ 107% ecm) and foreseen (~ 10-?5e cm) 
experimental sensitivities (see e.g. [162]). 


The value of óg in general depends on the mechanism for generating the u term. 
However, running effects can generate an adequate B term at low scales in these models 
even if B(As) = 0. One then finds [163] 

B 

"n = A,(As) + Ma(Ag) (—0.12 + 0.171%; |") , (192) 
where M; is the SU(2) gaugino mass. Since $4 ~ 0, the resulting ¢g vanishes, again up 
to corrections involving the Standard Model Yukawa couplings [164]. 


There is therefore no CP problem in simple models of gauge mediation, even with 
phases of order one. The supersymmetric contribution to D—D mixing is similarly small 
and we expect no observable effects. For the By system, GMSB models predict then a 
large CP asymmetry in B — Ks, with small deviations (at most 20%) from the SM. 

More generally, in any supersymmetric model where there are no new flavor violating 


sources beyond the Yukawa couplings, CP violation in meson decays is hardly modified 
from the SM predictions [165]. 
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7.3.2 Gravity, anomaly and gaugino mediation 


If different moduli of string theory obtain supersymmetry breaking F terms, they would 
typically induce flavor-dependent soft terms through their tree-level couplings to Standard 
Model fields. There are however various scenarios in which the leading contribution to 
the soft terms is flavor independent. The three most intensively studied frameworks are 
dilaton dominance, anomaly mediation and gaugino mediation. 


Dilaton dominance assumes that the dilaton F term is the dominant one. Then, 
at tree level, the resulting soft masses are universal and the A terms proportional to 
the Yukawa couplings. Both universality and proportionality are, however, violated by 
string loop effects. These induce corrections to squark masses of order SX m3 i», where 
ax = [27(S + S*)]-! is the string coupling. There is no reason why these corrections 
would be flavor blind. However, RGE effects enhance the universal part of the squark 
masses by roughly a factor of five, leaving the off-diagonal entries essentially unchanged. 
The flavor suppression factor is then [166] 


2 one—ioop 


(64) ~ “2 


~ ~4x 10. 
m; T 25 * (193) 


Dilaton dominance relies on the assumption that loop corrections are small. This probably 
presents the most serious theoretical difficulty for this idea, because it is hard to see how 
non-perturbative effects, which are probably required to stabilise the dilaton, could do 
so in a region of weak coupling. In the strong coupling regime, these corrections could 
be much larger. However, this idea at least gives some plausible theoretical explanation 
for how universal masses might emerge in hidden sector models. Given that dilaton 
stabilisation might require that non-perturbative effects are important, the estimate of 
flavor suppression (193) might well turn out to be an underestimate. 


We now turn to the flavor diagonal phases that enter in various EDMs. The phase $4 
vanishes at tree-level, so that (166, 167] ó4 = O(ay/m). [The smallness of ó4 implies that 
there is a suppression of O(ox/7) ~ 107? compared to (189) and the supersymmetric 
contribution to e'/e is small.] However, g is unsuppressed, even when p, and through 
it B, are generated by Kahler potential effects through supersymmetry breaking, in which 
case B = 2m3 oH [168]. While the size of m3/. is determined from the requirement that 
the cosmological constant vanishes, its phase remains arbitrary, and in fact depends on 
the phase of the constant term that is added to the superpotential in order to cancel the 
cosmological constant. 


We conclude that the supersymmetric £y problem is solved in these models but the 
EDM problem, in general, is not. For EDM contributions to be small in these models, 
the gravitino mass had better give a small physical phase. 


Anomaly mediation (AMSB) provides another approach to solving the flavor prob- 
lems of supersymmetric theories, as well as to obtaining a predictive spectrum. In the 
presence of some truly ‘hidden’ supersymmetry breaking sector, with no couplings to the 
SM fields (apart from indirect couplings through the supergravity multiplet) the confor- 
mal anomaly of the Standard Model gives rise to soft supersymmetry breaking terms for 
the Standard Model fields [169, 170]. These terms are generated purely by gravitational 
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effects and are given by 


Bu 1 

m(n) = ———— más mapí(u)- BO) sia, A(u) = -zY(u)ma;s 5 (194) 
g(u) 2 

where 6 and y are the appropriate beta function and anomalous dimension. Thus, apart 

from the Standard Model gauge and Yukawa couplings, the soft terms only involve the 


parameter 73/2. 


In general, naturalness considerations suggest that couplings of hidden and visible 
sectors should appear in the Kahler potential, leading to soft masses for scalars already at 
tree level, and certainly by one loop. As a result, one would expect the contributions (194) 
to be irrelevant. However, in "sequestered sector models" [169], in which the visible 
sector fields and supersymmetry breaking fields live on different branes separated by some 
distance, the anomaly mediated contribution (194) could be the dominant effect. This 
leads to a predictive picture for scalar masses. Since the soft terms (194) are generated 
by the Standard Model gauge and Yukawa couplings, they are universal, up to corrections 
involving the third generation Yukawa couplings. However, the resulting slepton masses- 
squared are negative, so this model requires some modification. We will not attempt 
a complete review of this subject here. Our principal concerns are the sources of CP 
violation, and the extent to which the AMSB formulae receive corrections, leading to 
non-degeneracy of the squark masses. 


For Equation (194) to correctly give the leading order soft terms, it is necessary that all 
moduli obtain large masses before supersymmetry breaking, and that there be no Planck 
scale VEVs in the supersymmetry breaking sector [171]. A possible scenario for this to 
happen is if all moduli but the fifth dimensional radius, R, sit at an enhanced symmetry 
point, and that R obtains a large mass compared to the supersymmetry-breaking scale 
(say, by a racetrack mechanism). Even in this case, however, there is a difficulty. One 
might expect that some of the moduli have masses well below the fundamental scale. 
If there are light moduli in the bulk, there are typically one-loop contributions to scalar 
masses-squared from exchanges of bulk fields, proportional to m2 /2 / P? times a loop factor 
[169]. If these contributions are non-universal, they may easily violate the Amy and €x 
constraints [133]. 


If there are no light moduli, and if the contributions described above are adequately 
suppressed, some modification of the visible sector is required in order to generate ac- 
ceptable slepton masses. Different such solutions have been suggested. In some of these 
models, there are no new contributions to CP violation simply because there are few 
enough new parameters in the theory that they can all be chosen real by field redefini- 
tions [172, 173, 174]. Furthermore, it is possible to generate the yz term in these models 
from AMSB, so that ¢g vanishes. These models are then similar to GMSB models from 
the point of view of CP violation. 


We conclude then, that in generic sequestered sector models it is difficult to obtain 
strong degeneracy and a special phase structure is required. It is conceivable that there 
might be theories with a high degree of degeneracy, or with no new sources of CP violation. 
In such theories, the SM predictions for CP violation are approximately maintained. 


Gaugino mediation [175, 176] is in many ways similar to anomaly mediation, and 
poses similar issues. These models also suppress dangerous tree level contact terms by 
invoking extra dimensions, with the Standard Model matter fields localised on one brane 
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and the supersymmetry breaking sector on another brane. In this case, however, the 
Standard Model gauge fields are in the bulk, so gauginos get masses at tree level, and as 
a result scalar masses are generated by running. Scalar masses are therefore universal. 
Furthermore, the soft terms typically involve only one new parameter, namely, the singlet 
F VEV that gives rise to gaugino masses. Therefore, they do not induce any new CP 
violation. 


Again, however, if there are non-universal tree and one loop contributions to scalar 
masses, significant violations of degeneracy and proportionality can be expected, and a 
special structure of CP violating phases is required. 


7.3.8 Supersymmetric flavor models 


Various frameworks have been suggested in which flavor symmetries, designed to explain 
the hierarchy of the Yukawa couplings, impose at the same time a special flavor structure 
on the soft supersymmetry breaking terms that helps to alleviate the flavor and CP 
problems. 


In the framework of alignment, one does not assume any squark degeneracy. In- 
stead, flavor violation is suppressed because the squark mass matrices are approximately 
diagonal in the quark mass basis. This is the case in models of Abelian flavor symme- 
tries, in which the off-diagonal entries in both the quark mass matrices and in the squark 
mass matrices are suppressed by some power of a small parameter, A, that quantifies the 
breaking of some Abelian flavor symmetry. A natural choice for the value of A is sin 6c, 
so we will take A ~ 0.2. One would naively expect the first two generation squark mixing 
to be of the order of A. However, the Amx constraint is not satisfied with the ‘naive 
alignment’, K¢, ~ A, and one has to construct more complicated models to achieve the 
required suppression [177, 178]. One can solve the supersymmetric £g problem by flavor 
suppression, that is, models with (6%) ~ A? [179]. These models are highly constrained 
and almost unique. It is simpler to construct models where (8%) ~ A? but the CP vi- 
olating phases are also suppressed [128]. Such models predict that aj, < 1 and are 
therefore now excluded. (Models with (61) ~ A? could still be viable with phases of order 
one if the RGE contributions enhance squark degeneracy.) 


For flavor diagonal phases, the question is more model dependent. There is however 
a way to suppress these phases without assuming approximate CP [179]. The mecha- 
nism requires that CP is spontaneously broken by the same fields that break the flavor 
symmetry (“flavons”). It is based on the observation that a Yukawa coupling and the 
corresponding A term carry the same horizontal charge and therefore their dependence 
on the flavon fields is similar. In particular, if a single flavon dominates a certain coupling, 
the CP phase is the same for the Yukawa coupling and for the corresponding A term and 
the resulting $4 vanishes. Similarly, if the » term and the B term depend on one (and 
the same) flavon, ¢g is suppressed. 

For &'/e, the €x constraint requires that the relevant terms are suppressed by at least 
a factor of A? compared to (189) [155] and the contribution is therefore small. 

We conclude that one can construct models in which an Abelian horizontal symmetry 
solves both the £x and the dy problems. These models are however not the generic ones 
in this framework. 
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Non-Abelian horizontal symmetries can induce approximate degeneracy between 
the first two squark generations, thus relaxing the flavor and CP problems [180]. A review 
of £x in this class of models can be found in [132]. Quite generically, the supersymmetric 
contributions to £ are too large and require small phases (see, for example, the models 
of reference (181]). There are however specific models where the £x problem is solved 
without the need for small phases [182, 183]. Furthermore, universal contributions from 
RGE running might further relax the problem. 


For flavor diagonal phases, it is difficult (though not entirely impossible) to avoid 
ba Z à? ~ 0.04 [132]. This, however, might be just enough to satisfy the dy constraint. 


With a horizontal U(2) symmetry, the two contributions to &'/e in (187) cancel each 
other. (More generally, this happens for a symmetric A matrix with Aj, = 0 [184].) 


We conclude that, similar to models of Abelian flavor symmetries, one can construct 
models of non-Abelian symmetries in which the symmetry solves both the £g and the dy 
problems. These models are however not the generic ones in this framework. 


Finally, one can construct models of heavy first two generation squarks. Here, 
the basic mechanism to suppress flavor changing processes is actually flavor diagonal so 
that mg, ~ 20 TeV. Naturalness does not allow higher masses, but this mass scale is 
not enough to satisfy even the Amy constraint [185], and one has to invoke alignment, 
KL, A. This is still not enough to satisfy the £x constraint of Equation (185), and a 
somewhat small phase is required. 


Three more comments are in order: First, in this framework, gauginos are significantly 
lighter than the first two generation squarks, and so RGE cannot induce degeneracy. 
Second, the large mass of the squarks is enough to solve the EDM related problems, 
and so it is only the £& constraint that motivates a special phase structure. Finally, 


the contribution to £'/e is negligibly small. Instead of (189), a more likely estimate 


is [155] (fp) ~ mz(Ma)ij which suppresses the relevant matrix elements by a factor 
LR)ij 


(10 TeV) 
of order 104. 


7.4 (S)Conclusions 


We would like to emphasise the following points: 


e For supersymmetry to be established, a direct observation of supersymmetric parti- 
cles is necessary. Once it is discovered, then measurements of CP violating observ- 
ables will be a very sensitive probe of its flavor structure and, consequently, of the 
mechanism of dynamical supersymmetry breaking. 


e It seems possible to distinguish between models of exact universality and models 
with genuine supersymmetric flavor and CP violation. The former tend to give 
dy €, 1077! ecm while the latter usually predict dy > 107% ecm. 


» The proximity of ayx, to the SM predictions is obviously consistent with models 
of exact universality. It disfavors models of heavy squarks such as that of refer- 
ence [185]. Models of flavor symmetries allow deviations of order 20% (or smaller) 
from the SM predictions. To be convincingly signalled, an improvement in the 
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theoretical calculations that lead to the SM predictions for ayx, will be required 
[186]. 


e Alternatively, the fact that K — avi decays are not affected by most supersyrn- 


metric flavor models [187, 188, 189] is an advantage here. The Standard Model 
correlation between a4,5 and ay, is a much cleaner test than a comparison of a, x, 
to the CKM constraints. 


e The neutral D system provides a stringent test of alignment. Observation of CP 


violation in the D — Ka decays will make a convincing case for new physics. 
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1 Introduction 


The brief for these lectures is to introduce some of the ideas and challenges for super- 
symmetry coming from flavour and CP physics, aimed at an audience that includes ‘non- 
experts'. It is usual for courses on this subject to jump in at the superpotential for the 
Minimal Supersymmetric Standard Model, and leave the reader to sort out for him or 
herself what the superpotential actually means. In this course I have instead decided to 
begin at square one, assuming no prior knowledge other than field theory of scalars and 
fermions. While it might seem a challenge to get from this point to the MSSM in a just 
page or two of working, this is in fact possible (and only slightly painful) if we do not 
spend too long worrying about things like 'superspace', which although quite fundamen- 
tal, do not really have much bearing on the question of phenomenology. Thus the first 
section will develop the idea of a superpotential by giving a quick and dirty demonstration 
of how to ‘supersymmetrise’ a fundamental scalar. By following the (very explicit) steps 
in this part, those readers who are unfamiliar with supersymmetry can get a grasp of the 
various concepts involved. In particular we will see at first hand the famous cancellation 
of divergences that makes supersymmetry such an important idea. This will lead natu- 
rally to the MSSM superpotential and from there we can go on to analyse the particular 
problems that flavour and CP throw up for supersymmetry. 


The interests of a SUSSP audience are typically rather broad, so the course is organised 
as follows. The reader who requires a brief overview of the subject will be able to follow 
the main body of the text which is self contained. However, since some of the audience 
may want to develop a deeper knowledge of the subject, I have made copious use of 
appendices to include more detailed discussions of various aspects as we go along. From 
these it should be possible for the reader to follow some of the more formal texts in [1]. 
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Why supersymmetry? 


Although there are several responses to this question, the most important is undoubtedly 
cancellation of divergences. This is a remarkable feature of supersymmetric models that 
gives stability to any theory, specifically stability of mass scales. We shall shortly derive 
it in detail, but in brief it goes as follows. 


Consider a theory involving scalars with self coupling às, the Higgs particles, and also 
fermions which couple to the Higgs particles in Yukawa couplings with strength Aș. 


Figure 1. Contributions to the Higgs potential. 


At one loop order, the Higgs potential will get a divergent contribution from the 
diagrams shown in Figure 1, which is of the form 


5Vitiggs ~ (Au^ — A5)? AP. (1) 


The fermionic loop gets an extra minus sign. This type of contribution is a disaster 
for electroweak symmetry breaking, since we will have to tune our 4, and A, couplings 
very precisely (to at least one part in 10!* if we take A > Maur ~ 10!9GeV) if the 
physical Higgs mass and hence weak gauge bosons are to end up around the 100GeV 
mark. Moreover, the À couplings will themselves have logarithmic radiative corrections 
from one-loop diagrams, and in fact we need to re-tune at each order in perturbation 
theory. Finally the low energy theory will be sensitive to unknown high energy physics - 
for example an unknown GUT theory or string theory. The problem introduced by the 
existence of vastly different mass scales is referred to as the gauge-hierarchy problem. If 
you believe that the Higgs fields are the result of some non-perturbative process (a top 
quark condensate perhaps) happening just above the weak scale, or that there are no new 
scales above Mw, or more generally, that there are no fundamental scalars, then this will 
not be a problem for you. 


Supersymmetry solves these technical problems by ensuring that A; = Ap, and in fact 
all contributions to the effective potential then vanish. The only renormalisation of the 
bare couplings which takes place comes from field renormalisations. The price we have 
to pay for such a useful cancellation is high; we have to introduce a fermionic(bosonic) 
degree of freedom for every boson(fermion) (known collectively as superpartners). The 
general form of the couplings will be 

2 
Nua) ~ Ati) log ft (2) 
H2 
Now couplings which start off small at the GUT scale remain small. (This holds to all 
orders in perturbation theory.) 
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This then is the main reason for taking supersymmetry seriously. Other reasons (in 
descending order of compulsion) are that the gauge couplings of the supersymmetric 
standard model (MSSM) appear to exactly unify at some (presumably GUT) scale, or 
that the only known theory of quantum gravity (ie. string theory) strongly suggests 
it, or that the cosmological constant is zero. This last reason is probably invalid, since 
supersymmetry must be broken at some scale, reintroducing a cosmological constant 
(albeit one which is only out by 52 orders of magnitude instead of 120). One additional 
interesting hint for extra scalars comes from the fact that for relatively low mass Higgs 
particles (around 115GeV say) the Standard Model vacuum is unstable to decay, as shown 
in [2] and references therein, because the top mass is large. This instability can be removed 
if there are more scalars than just the Higgs fields. 


2 Express supersymmetry 


Our aim in this section is to derive (or at least explain the form of) supersymmetric inter- 
actions from their non-supersymmetric counterparts (SM). We begin in this subsection by 
supersymmetrising a simple Lagrangian describing a massless, free boson. We will then 
discuss how to write down interactions using the superpotential and examine the top 
quark Yukawa coupling in detail, in particular verifying the cancellation discussed above. 
Readers already familiar with supersymmetry can skip to Section 3. Readers who would 
like to see the more detailed conventional approach can consult the standard texts [1]. 


Let us begin by considering the Lagrangian for a free boson: 
L = — (3 H“ (3H). (3) 


Since we wish to describe both particles and anti-particles, ó is complex with two degrees 
of freedom. Also $ has spin 0 and dimension 1. This Lagrangian resembles that of 
the Higgs which is already present in the SM. Eventually, each chirality (left or right) 
of matter fields will aquire a scalar superpartner of this type, together forming chiral 
supermultiplets. 


Now let us introduce our fermion superpartner which must also have two degrees 
of freedom for a sensible transformation. Before continuing the reader should familiarise 
themselves with the Weyl notation for fermions which has become standard for this subject 
and is outlined in Appendix A. The supersymmetry transformation should by definition 
turn the scalars into fermions so, noting that a two component spinor has the same 
number of degrees of freedom as the scalar, let us introduce an infinitesimal (linear) 
supersymmetry transformation ôg on the complex boson above, involving such a spinor: 


be = a£? X,. (4) 


The fermionic parameter £^ describes the magnitude of the transformation, and a is some 
as yet undetermined constant. We will often drop the fermionic indices œ when they are 
contracted. The spin and dimension on both sides must match, so that £^ must have spin 
1/2 (it anticommutes with itself and X) and dimension 1/2. Next we wish to have an 
equivalent transformation for the spinor itself. Matching spin and dimensions again, this 
can be of the form, 


5Xq = bo" EO, + Cok. (5) 
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where F is some field of spin 0 and dimension 2 whose existence will be explained shortly. 
Then for F we can have; 


ôF = d£,0' ^0, x,. (6) 
In order to determine the constants a,b, c, d we need to consider the commutation of two 
transformations. For example, 


[5¢, 6,]ó = ab(no" X — Xo"7)0, o. (T) 


This commutator involves only one field on each side, and is therefore related to the 
underlying symmetry (supersymmetry). It describes a relation between the dg and ô, 
operators, and should not depend on which field (¢, X or F) we choose to express it. (The 
equivalent statement in gauge theory, is that the Lie-algebra [T^, T°] = ifas-T is formed 
by successive gauge transformations on any of the fields in the theory, irrespective of their 
representation.) We need to ensure that the algebra is the same for the field X, and this 
is why the field F is required; in fact 


[óc, ôn] X = cd(no#X — Xo"7)0,X, (8) 


giving ab — cd. The extra field F is known as an auziliary field since it is there simply 
to make the algebra close. It does not propagate, and so it is easily removed from the 
Lagrangian using its equation of motion. The full chiral supermultiplet consists of the 
three fields & = (9, X, F) (and the anti-chiral supermultiplet, © = {¢*,X,F*}). Having 
determined the multiplet structure, we are able to write down the simplest invariant 
Lagrangian. Using the usual normalisations, our first guess is, 


Lre = —iXo"0,X + $030 + F*F (9) 


In order for this to be invariant (i.e. to transform into itself plus total derivatives under 
the operations (4)-(6)), we find that we require 


b — ia, d — ic, (10) 


which leaves only one degree of freedom a which normalises the operators 6g. Conven- 
tionally we take a = V2. 


So far we have only the kinetic terms of our Lagrangian. In order to be able to include 
Yukawa couplings, for example, we need to consider additional interaction terms. As a 
starting point, notice that the auxiliary field F transforms into a total derivative under a 
supersymmetry transformation. Thus 


Le=F (11) 


is a suitable invariant. By itself this is of course not much use, since we also wish to 
include mass terms and gauge symmetries. In order to do this we must first generalise 
our infinitesimal transformations to make them finite; a transformation through a finite 
spinor parameter 0, is given by the operator exp dg. Arbitrary transformations through 
9, of the scalar field (x) will change its value in a one-to one manner. We can now 
construct a scalar quantity, (x, 0,0), which gives (x) for arbitrary values of ĝa. Taking 
$(x) as the starting value when ĝa = 0, 


$(z,0,0) = (z), (12) 
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we can find the general form of ® by transforming ¢; 


$(7,0,8) = ( "m jv " gv + Qj +. ..) ó(z) 
= $(y) + V20x(y) + 0F(y) (13) 


where y^ = z” + i00%9. We have used the identities 676, = 0 (because 6, has only 
two components which anticommute), and e*°99 = —20^0. These scalar functions are 
referred to as chiral-superfields. The Taylor expansion in 6s of any function terminates at 
the 69 term, and consequently any general function of y and @ is a chiral superfield, and 
its components must transform as the components of a chiral supermultiplet. Beginning 
with $* results instead in anti-chiral superfields (denoted ®) which are functions of y* 
and 8 only. 


Chiral superfields enable us to build up more general Lagrangians, once we notice that 
(trivially) the product of two or more chiral superfields is also of this form and therefore 
is also a chiral superfield. So, for example, the F-term of the product of two chiral fields 
gives mass terms, 


1 
Lin = e the. = $F — z^ ds the. (14) 


It is easy to check that this is invariant under the infinitesimal supersymmetry transfor- 
mations. For the Yukawa interactions we need the product of three superfields, 


1 
Lyuk = 39:99. 


F + h.c. = AgkFió;ók = Auk? Vjadk + h.c. (15) 


The generic indices i,j,k stand for summation over gauge representation and/or field 
labels (e.g. Higgs, matter etc.). Terms of order higher than three make the theory 
non-renormalisable and will not be considered further. The most general interaction 
Lagrangian is therefore of the form 


Lint = W($,)|,. + h.c. (16) 


where W (4) is a third order polynomial of the chiral superfields, known as the super- 
potential. In terms of the superpotential, the Lagrangian is 


OW 1 gw 

Line = FS — Dij he. 

' 0d; 7" ý 09,09; tac 

where only the scalar part of the potential is taken, and summation over indices is implied. 
Finally, by using the equations of motion of F}, 


(17) 


oW 
F = Er» (18) 
we are able to eliminate it, to give 
aw\? 1, 9W 
int = | =] zz the. 


In particular we can see that the supersymmetry potential is positive definite, and has a 
minimum at 0W/0¢@; = 0 where it vanishes. 
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So we have derived a Lagrangian which is able to describe scalars and their fermionic 
superpartners. For this introduction we have been avoiding the notion of superspace; the 
interested reader can consult Appendix B which explains the connection. Now let us 
progress to the simplest non-trivial example, the top quark Yukawa, in order to see the 
promised cancellation of one loop divergences. 


The top quark Yukawa 


To write down a top-quark Yukawa we naturally define the superfields to have the exactly 
the same charges as they would in the Standard Model: 


Thus we have 
Wtop- Yukawa = MQH ute (20) 
leading to a Lagrangian of the form 


Ltop—Yukawa = —Arghate V Aid (ute) ~ M(ghu)te + X [Rute]? + Ahi? + Ate". (21) 


It is customary to write the superpartner of any field as that field with a tilde and we 
shall follow that convention here. Now consider the one loop contributions to the h, mass 
squareds. The internal fermions can only be the pair q and t, , whereas the scalar ‘bubble’ 
diagram gets two contributions separately from d and f. Including the symmetry factor 
of 1/2 in the scalar diagram, and -1 for the fermion loop, we find exact cancellation as 
promised. 


As yet we have not included the gauge fields (we have no spin one fields), however 
we can guess what their Lagrangian might look like from the above. Supersymmetry 
makes fermions and bosons indistinguishable, so not surprisingly the net effect on the 
Yukawa coupling was to introduce duplicate 'super'-Yukawa couplings with the fermions 
and bosons permuted (the first three terms in Equation 21). The same duplication hap- 
pens in the gauge sector as well. Here the physical components of the vector supermul- 
tiplets are (A4,v4,D4] (where A is a group index) which are respectively the gauginos, 
gauge bosons (of the Standard Model) and auxiliary fields. The matter-gauge-matter 
couplings become 


Coo, VW = 9X0, vj; X; + iV 2g@F AX; + h.c. (22) 
Where we use the usual notation that vii = v47/4. As with the Yukawa, the introduction 


of supersymmetry has simply added supersymmetric counterpart couplings with the roles 
of boson and fermion reversed. 


The inclusion of gauge symmetry and the supersymmetrisation of the Yang-Mills term 
involves so-called vector supermultiplets. It is possible (although much more arduous) to 
build these supermultiplets beginning with the lowest component, as we did above for the 
chiral supermultiplets. However, for the discussion of CP and flavour phenomenology we 
now have everything we need; the reader interested in these more technical aspects can 
turn to Appendix C where a discussion of the total gauge invariant Lagrangian is included 
for completeness. 
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3 The softly broken MSSM 


We now have a general framework for building supersymmetric gauge invariant La- 
grangians. Our aim is to build a model which includes this, and yet mimics the Standard 
Model at low enough energies. A consensus seems to have been established as to the most 
likely way this might happen in Nature. This is referred to as the softly broken Minimal 
Supersymmetric Standard Model (MSSM). 


There are a number of influences guiding the form of the MSSM, one of the most 
overriding being ‘naturalness’. A case in point is the scale of supersymmetry breaking 
itself (clearly a requirement). Most would agree that this should not be too high, otherwise 
fine-tuning is re-introduced. Because of this, a ceiling of a few TeV is usually placed on 
this parameter. One should bear in mind however that there are no other upper limits 
on the scale of supersymmetry breaking. This problem is compounded by the fact that 
there is still a large number of additional parameters left in the model (supersymmetry 
breaking scalar masses for example). Often these are reduced by considering a particular 
kind of supersymmetry breaking mediation - for example mediation by supergravity (local 
supersymmetry) or gauge mediation, but also they are reduced by considering expcriment. 
itself. (For example, the absence of large FCNCs is often cited as the reason for taking 
scalar masses to be degenerate at the GUT scale.) Because of this, getting genuine 
predictions out of supersymmetry is difficult, and usually the ‘prediction’ is merely that 
supersymmetry is consistent with a large (g — 2), or electric dipole moment of the neutron 
or b — sy or whatever. In fact, usually supersymmetry is also consistent with values of 
(g — 2), etc. which totally violate current experimental bounds, thus lending an air of 
‘just around the corner’-ness to the whole business. 


This difficulty really arises out of our ignorance about the nature of supersymmetry 
breaking and leads to what has become known as the supersymmetric flavour and CP 
problems. These general problems will be illustrated in this section, and expanded upon 
later. First however, let us present the superpotential of the MSSM. 


3.1 The MSSM superpotential 


Since we wish to stay as close to the SM as possible, we assume that the same gauge groups 
and representations will appear. Thus, the group will be SU(3), x SU(2), x U(1)y. The 
gauge bosons (g, Wi» and B) are accompanied by fermionic superpartners, the gluinos, 
winos and bino. The matter and Higgs fields become chiral supermultiplets. These 
are usually represented by capital letters, so that we have Q = (dr,qu], L = {li}, 
e — (ü$, uk} and so on for each of the quarks and leptons, and H = {h, h} for the Higgs 
doubler L, Q and H are SU(2) doublets with 
h? 
(E) 0m 


e(a) eG) 


The superpartners are the squarks, sleptons and Higgsinos respectively. 


We now encounter a minor problem. In the Standard Model we were able to make do 
with only one Higgs field, to generate two masses. The Yukawa couplings whìch did this 


208 Steven Abel 


were of the form . u 
Lyuk = Aelther + Aqdz hdr + AuG hug + h.c.. (24) 


The k scalar has hypercharge Y = 1, and the h scalar is given by h = irjh* with Y = —1. 
(The 7; are the Dirac matrix generators of SU(2), acting on isospin indices, not to be 
confused with the c;.) Clearly, if we turn these fields into chiral supermultiplets, the two 
quark mass terms cannot both be derived from a superpotential since if h corresponds 
to a chiral superfield then A* corresponds to an antichiral one. An additional problem 
comes from triangle anomalies. These are a potential problem in any theory with chiral 
fermions, and their disappearance requires that 55; Y; = X; Y? = 0. In the Standard 
Model, these cancel exactly, but now this is no longer the case since we have introduced an 
extra fermion, the Higgsino, which also couples to isospin. Happily, both these problems 
are solved by the introduction of a new Higgs superfield with hypercharge —1, which 
replaces h. Our total particle content has the following assignments and transformations 
under the Standard Model gauge group, 


H? 
L 2) (1,2, -1) H, 6 (1,2, -1) 


Il 
es 
x 
m 


ur Hj 
Q = (5) (3, 2, 1/3) H, = HM (1,2,1) (25) 
E* (1,1,2) 
De (3,1,2/3) 
ue (3,1, —4/3) 


where H, and H; are two entirely separate superfields. (There is, by definition, no right 
handed neutrino in the MSSM although clearly experimental data now suggests one. 
Neutrino physics requires a review in itself and we shall not attempt to cover it here.) 
The Yukawa piece of the superpotential (taking a little more care with our isospin indices 
for a moment) may now taken to be 


Wy = ÀgE HT eL + ApD*HTeQ + AyU*HT€Q + h.c., (26) 


where we have suppressed colour and generation indices. The c' = i7; is the same matrix 
as e, the prime being there simply to remind us that its indices act on isospin. SU(2) 


gauge invariance follows from the fact that e'r? = —(r*)Te, so that (e'^")Te'e'^* 


-e. 
The extra Higgs superfield has introduced another parameter; tang = (H9)/(H9). 
The fact that this ratio can be large is seen as a possible explanation for the large value 


of the top quark mass. 


In addition to the Yukawa terms, we will require some Higgs self-couplings in order to 
generate the correct electro-weak breaking; 


WHiggs = uHTe Ho, (27) 


where u has dimensions of mass. In order to get the correct scale for breaking (i.e. the 
correct values of Mw and Mz) it turns out that we need to have u = O(My). The fact 
that we have an additional dimensional parameter (besides the scale of SUSY breaking) 
that needs to be set by hand, is known as the ‘y-problem’. 
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3.2 Soft SUSY breaking 


If supersymmetry is the correct extension of the Standard Model, then it is expected to 
be softly broken by dimensionful parameters (such as squark mass terms), in order for the 
broken supersymmetry to still remove quadratic divergences and provide a solution to the 
problem of maintaining a hierarchy between the electroweak scale and the Planck scale. 


The physics of the breaking is not yet well understood and there are several mech- 
anisms that can achieve it. To take account of our ignorance one can write a general, 
gauge invariant, Lorentz invariant, effective Lagrangian; 


L = Lsusy + Lsoft (28) 


where the first term preserves supersymmetry invariance and the second violates super- 
symmetry and is given by (dropping the c) 


-Lst = $ (M399 + MAWW + MBB + hic.) 
+ M2Q*Q + MIU" 0° + MED De + M3L*L + MBE E: 
+ AyAyUQH2 + ApApD*QH; + AgAgE* LH, + h.c. 
+ mà, Hf Hy + m}, Hf H, + (BHH, + h.c.) 


(29) 


where all generation indices have been suppressed. The most general softly broken MSSM 
has a total of a 124 physical parameters. The count goes as follows; 6 of the parame- 
ters come from gaugino mass terms of the form M; = |Mi| e'*. The squark and slepton 
masses are in principle 3x3 Hermitian matrices with complex matrix elements, contribut- 
ing 5x6x2 = 60 parameters. Trilinear couplings between the sfermions and Higgs bosons 
are arbitrary 3x3 complex matrices which constitute 2x9x2 = 36 parameters. Addi- 
tional parameters arise from the u term, resulting in 33 mass eigenstates, 43 phases and 
the CKM angle. 


3.3 Flavour and CP problems: mass insertion approximation 


Given the most general form of supersymmetry breaking in (29) we see immediately that 
there are likely to be phenomenological problems with both flavour changing and CP 
violating processes. For example, using the new supersymmetric contributions to the 
Lagrangian in equations 21 and 22, we can construct a diagram (Figure 2) with an 
internal wino and squark—one of many possible supersymmetric contributions to b — sy. 


h or W 
t; Ng 
~g or "n b ' 1 s 
q q | 


Figure 2. A supersymmetric contribution to b — sy. 
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The diagram includes a flavour changing mass insertion on the squark line and our 
lack of knowledge about supersymmetry breaking means that in principle this insertion 
could be as large as any of the other terms. Likewise, the supersymmetry breaking terms 
can have arbitrary CP violating phases and it is not difficult to construct diagrams that 
contribute to electric dipole moments (of which more later). 


The appropriate way to treat these problem generally is to diagonalise all the mass 
matrices and calculate the diagrams with the mass-eigenstates. However there is a useful 
approximation, the mass-insertion approximation, that can be used when the supersym- 
metry breaking is approximately generation independent. This type of situation is usually 
motivated by particular kinds of supersymmetry breaking mediation, for example media- 
tion by gravity resulting in mSUGRA models. In such models the canonical assumption 
is that the supersymmetry breaking terms are flavour universal at the GUT scale. Thus 
the most commonly considered scenario has 

m; = ym? 
Ma Mi 
Avy.. = Ap,, = Ag Adj; . (30) 


i ij 


1j 


Together with the parameters 4 and B we then have 5 independent parameters. |x|? can 
be determined by electroweak symmetry breaking, and traditionally we trade 4B for tan Ø, 
the ratio of Higgs VEVs, leaving us with 4 independent parameters, mo, Mi», A, tan JJ. 
In addition there are 2 physical phases 4 = arg( AM) and $, = arg(u Mr). All other 
phases can be rotated away. Typically these phases must be small in order to satisfy all 
electric dipole moment (EDM) requirements (including that of the mercury EDM [6]). 
This model, sometimes referred to as the Constrained MSSM (CMSSM), is an example 
of a Minimal Flavour Violating model (MFV model), defined as models where all flavour 
and CP violation is attributed to the Yukawa couplings. 


Typically, running the renormalisation group equation from the GUT to the elec- 
troweak scales, will result in flavour non-universality thanks to the contribution from the 
Yukawa couplings, however they will generally be small. (Note that splittings between the 
squared masses of left and right-handed scalars, and also between squarks and sleptons 
will generally be large because of their different gauge couplings.) Thus, when considering 
a particular process, it is useful to consider a situation where the squark masses matrices 
are flavour universal and real, and to find the experimental limits on perturbations. This 
approach, pioneered by Gabbiani et al in [3], allows one to state the flavour and CP prob- 
lems more quantitatively since it leads to model independent bounds on mass insertions. 
One begins by defining perturbations from universal scalar masses, m%, as follows: 


f 
2 2 2 
q (mj)un;; s, = (mtu, i, = (ma) RR; 31 
LRij m me , LLij = mè , RB = mÈ t ( ) 
f f 


It is then fairly straightforward to calculate the leading contributions to various processes 
by considering the diagrams with a single insertion. The approximation is represented 
pictorially in Figure 3. The fully diagonalised quark propagator would have diagonalisa- 
tion matrices for the masses, Vj, where 7 labels flavour and a labels one the 6 squark mass 
eigenstates, on either end. However, this is equivalent to treating the flavour changing 
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Figure 3. The insertion approximation for the quark propagator. 


as an interaction vertex (denoted X). The experimental bounds can then be expressed as 
limits on the 6’s, given in Table 1 and Table 2. For b — sy decays with mj-—500GeV the 
main contributions are give in Figure 4 and the limits in Table 3. CP violating processes 
constrain imaginary insertions. e.g. for M3 = m;—500GeV we have the limits in Table 4. 


Limits on- | Rel, Y | ARE, 8 | Re, ReGRa, 
ij — sd 004 | 0.004 0.003 
ij — bd 0.1 | 003 0.02 
ij =cu 0.1 | 903 | 0.02 


Table 1. Limits from AF = 2 with M3(Mw) = m;—500GeV. 


Limits on- | lor, | | |ó M 
ij — ue Pos [zm 
ij — re | 2 9 | 0 

ü-ra | 5 | 002 


Table 2. Limits from l; > ljy decays with M, = m;=100Ge V. 


Figure 4. Main contributions to b > sy decays with m;-—900Ge V. 
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[2 = M3/m* | (Stil | [SE 
| 03 | 44 | 0.013 

1 8 | 0.016 
: 4 26 | 0.03 


Table 3. Limits from b — sy decays with mz=400Ge V. 


€ /|[Im(54,,,)2| < 0.002 


e'fe Jm(&i, ,)| < 0.5 |I m(814,,)| < 2.10? 
| dn «61075 | [Im(5$p,,)| < 110-9 | |Im(8tg,,) < 3.107 


[Im(54p,,)2 < 0.0006 


Table 4. CP violating processes constrain imaginary insertions. Limits are for M4 — 
m;=500Ge V. 


Satisfying experimental bounds/results on flavour changing and CP violating processes 
obviously prohibits significant perturbations from universality for many of these param- 
eters (although other reasons besides universality have been considered in the literature, 
and will be mentioned later). This apparent fine tuning is the quantitative form of the 
flavour and CP problems. 


Incidentally, in the case of the mSUGRA, all of these constraints are satisfied. Note 
that, had this not been the case (i.e. if running the RGE's in mSUGRA had resulted 
in supersymmetry breaking terms that violated these bounds) there would have been 
very little hope of ever explaining electroweak phenomena (e.g. symmetry breaking and 
masses) in terms of Planck or GUT scale physics. 


4 The mass-spectrum and generic SUSY breaking 


The mass insertion approximation outlined in the previous section is useful for addressing 
the question: how far can we deviate from universality and still satisfy experimental 
bounds? However, there may be (and often are) other wildly different patterns of soft- 
supersymmetry breaking which can have a significant degree of non-universality and yet 
have cancellation of contributions to various processes. (One example is models with 
Hermitian flavour structure, in which the EDMs are suppressed). 


In order to calculate in more general scenarios, it is necessary to diagonalise all mass 
matrices and find the physical mass-eigenstates. The particle content and diagonalisations 
proceed in the following manner. First we collect the components with the same e.m. 
charges together into squarks, charginos, neutralinos: 


QLi 1.3 ® dns > Ga=1..6 


hjoW* + X 
MELLE 
hj DohoeW'eB > Xais (32) 
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The sfermion mass-squared terms are given by 
px Py age (fe 
Ly mass—squared = (fr , Fri M5 (? ) (33) 
where the M} _are 6x6 matrices 
ij 
; Mji--m3--M2(5-Q, sin’ Oy) cos 28 mj(A,y — u* Ry) 
Mi- (34) 


h.c. Mg + m3 + M2Qy sin? Ow cos28 


with R; = cot  ( or tan 8) for I; = 1/2 ( or 1/2) and tan 8 = (h2)/(h1). We diagonalise 
this mass matrix by making a unitary transformation on the squarks 


— t 
M$ bap m (Vi Mj Vj), (35) 


The charginos have a mass term 


Leharginos = (W*, hi), Mc; W- (36) 
hy 
j 
where 
M. 2My si 
Me = 2 v2 w sin B (37) 
V2My cos 8 lt 
and we require a bi-unitary transformation to diagonalise the mass matrix: 
My+-UGMcVe' (38) 
For the neutralinos we have: 
Lyeutralinos = (5°, W^. ha, hn) ; Mny B (39) 
1 iyo 
ha 
ho 
uj 
where My is symmetric: 
Mj; 0 -MzsinÜwycosB Mzsinéy sin 8 
My = T My MzcosOwcosG —Mzcoséy sin B (40) 
T T 0 —p 
T T T 0 
with 
Myo = Vai My Vn (41) 


The quantities which then enter calculations of FCNC and CP violating processes are 
then of course the diagonalisation matrices, Vz, Vc, Uc, Vn which appear on the vertices. 
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4.1 An example calculation and the SuperGIM mechanism 


All calculations follow more or less the same format, so let us examine just one in detail, 
the supersymmetric contribution to the EDMs. EDMs are quite complicated beasts, 
and get contributions from electromagnetic, chromomagnetic as well as purely gluonic 
contributions shown in Figure 5. See references |4, 5, 6]. 


to REP a " 


Figure 5. Leading SUSY contributions to the EDMs. The photon and gluon lines are to 
be attached to the loop in all possible ways. 


For simplicity I will consider only the electromagnetic contributions to the fermion 
EDMs. These are given by [5] 


d? (5 Ie. Ê m- m 
€ 888 y q(pe "Gd 
3x; 24 mU, ype 9s M2)’ 


gk 


Xt (E) m? m2 
d — Qem x) Xt 
= = I ui B J u We A( j ) ` 
e An sin? Ow pap X mT uir) aile: ( M? +Q a M 
k k 
X* (E) m? m? 
: EX Te [oo ) Gs) 
= lem Im(Is, B(X -Q3 af), 
e 4r sin? Ow SÈ ma (Taie) Mi Qs MEI (Qa = Qr) m 
X* (E) 2 " 
d, Qem my+ X) 
e ~ Aa sin? by » m Im(T«) A mij! 
X? (E) 2 
d a 6 Myo 
f _ em I c3 42 
e Ar sin? By pps ma) ME ue eB ME) (42) 
k 
where 
D =e Va Va, (43) 


with ¢3 being the gluino phase. The chargino factor I fi comes from products of vertices 
with attached charged Higgsino or wino: 


Puik = — Au Voi) a (6a V T — Uca 


dlk EE 


Paik = AU Va Ven Very — À u Véio Vzak) (44) 
and analogously for the electron. The A; are the Yukawa couplings 


Mau Mde 


àu = ~, Ade = M. 45 
V2mw sin 8 i v2my cos B (45) 
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The neutralino vertex rri, is given by 


nye = |-V2(tan6w(Qy — I5) Vui + I, Vy] V. - A Vni Vs 


x V2 tan wQ;VyiVz - AVV] , (46) 


where I; is the third component of the isospin, 6 = 3 (or 4) for J, = —1/2 (or 1/2). The 
loop functions A(r), B(r), and C(r) are defined by 


1 2Inr 
Ar) = al ne 1I), 
H 2rlnr 
B(r)- 2——45 
(r) xag trt p) 


C(r) = (47) 


1 2rinr  18lnr 
10r — 26 ). 
sel i *1-r l-r 


Note that the indices ‘1’ and ‘4’ appear in the above because for the EDMs of the down, 
up and electron we pick out the left and right chirality fermions of the first generation. 


This completes the electromagnetic contribution to the quark and electron EDMs. 
We can now see in a more general way why flavour universality gives cancellation of 
contributions. Consider for example the gluino contribution to EDMs. In the event that 
the squark masses are universal and equal to M2 this can be written as 


ds (E) —2a, m; mix £ 
e = —% 79, B(T4) Y mmi. (48) 
< q q' k=l 
The last factor in this expression is 
1ky _ - i) Ln — 
Im(Py*) = e (V3). =0, (49) 


and consequently the gluino contribution exactly vanishes because of the unitarity of the 
diagonalisation matrices, Vz. With exact universality the leading contribution is from 
the chargino diagram, however these are proportional to Im Y .,(F'ux). Again unitarity 
means that the main contribution here is from the phase on the p term (unfortunately 
the least well understood of all the parameters in the MSSM). Note that even in the 
CMSSM, unitarity is broken between left and right chiralities at the weak scale because 
of the different gauge coupling contributions to the mass-squared RGEs. Because of this 
there is also a small dependence on the phase of the A-term ($4) even in the CMSSM. 


A similar sort of cancellation occurs for flavour changing processes, where it is referred 
to as the superGIM mechanism. For example consider the chargino contributions to A My 
shown in Figure 6. 


The Ks — K; mass difference is given by 


AMxy = 2Re(Mis) (50) 
where 
2 2 
o2 6 2 D(Mz, , Mz, MẸ MX) 
Mie = 54 Ba MK fk »» »» Aoig Tig Aoi Ar k ^ t J + (51) 
kk' ij My, 
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a, 


B 


Figure 6. Chargino contribution to AMy. 


and 3 
* qp D 
A fik = Vou Vajk m 2 AU py VoaVzp ay (52) 
and where f, f' label flavours, and D is a dimensionless one-loop integral which is of order 


one if the chargino and squark masses are similar. Here the first term in A comes from 
the üdW term and the second from the üdh term in the Lagrangian. 


Again we see that degenerate squark masses leads to suppression because of the uni- 
versality of the diagonalisation matrices. Indeed because only left handed external quarks 
are involved in this particular diagram, the contribution to AM, from the first term in 
the As is proportional to the mass splitting in the first two generations of left handed 
up-squarks which in the CMSSM is driven by the renormalisation group effects of the 


Yukawas: 2 


m, 

AMx x —. (53) 
my 

In addition to this contribution there are contributions involving cross-terms in the As, 

but because we need two cross terms (there are 4 As) they will be of the same order of 


magnitude. This overall suppression is the super-GIM mechanism. 


5 Aspects of CMSSM phenomenology 


The universality of supersymmetry breaking is found not to hold in most string derived 
models and in general supergravity models, and we expect at least some non-universality 
in the flavour structure of the A terms. However because of the super-GIM mechanism, 
the CMSSM is the minimal model from the point of view of avoiding nasty FCNC and 
CP violating contributions. It is important therefore to estimate what the new super- 
symmetric contributions to these processes are likely to be in the CMSSM, in order to 
provide a kind of lower bound on the expectations for new physics. These effects come 
from the running of the renormalisation group equations, whence the flavour dependence 
of the Yukawas enters the supersymmetry breaking. These effects are small and so the 
mass insertion approximation is valid. However, there is no general way to estimate 
the resulting size of the ós induced by the renormalisation group, because the RGEs are 
non-linear and in any case some of the parameters depend on the minimisation of the 
one-loop electroweak potential. Thus a useful approach is to concentrate on the dorni- 
nant contributions in the CMSSM (and indeed Minimal Flavour models). As we have 
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seen, contributions from diagrams involving gluinos are suppressed in minimal flavour 
models (this is a general result) and the neutralino contributions are also suppressed in 
the CMSSM. Hence the dominant contributions are those involving charginos and charged 
Higgs particles. This greatly reduces the number of parameters and means that one can 
survey the parameter space rather generally without having to perform the minimisation 
of the one-loop potential. 


A study along these lines was carried out in [7] and recently improved and updated 
in [8] where many more rare decays were also considered. (This reference provides a 
useful compendium of expressions for calculating various rare processes in SUSY, and 
summarises the new operators that are introduced into the basis for Her. Many of the 
expressions for the supersymmetric contributions are also found in [9].) Here the dis- 
cussion will concentrate on the supersymmetric contributions to the Wilson coefficients 
calculated at ^ My, so the reader is referred to that reference for more information on 
how to turn these into the measured parameters.) The later study includes NLO-QCD 
and QED corrections as well as constraints from electroweak precision studies, exclusion 
bounds on Mp, and the B — X,y decay rate. The input parameters required for such a 
study are only those that effect the chargino and Higgs masses and mixings: 


mg, my, 67, my o tan B, Tigz, p, My (54) 


where mz, are the heavy and light stop masses, 6; is the stop mixing angle, my the 


lightest chargino mass, and m; the common slepton mass (for the rare decays). Note that 
there are more parameters here than in the CMSSM (including the breaking of electroweak 
symmetry), but in performing a study using these parameters one is addressing a wider 
class of models, including all MSSM models that have Minimal Flavour Violation (for 
example those in which the scalar masses are flavour independent but not necessarily 
degenerate). One then scans over reasonable values of these parameters. Again what 
constitutes ‘reasonable’ is open to interpretation. However, imposing masses less than 
1TeV seems appropriate in order to avoid reintroducing fine tuning. The parameter |] 
can typically be larger than the range |u| < 500GeV chosen in this study, although larger 
values reduce the mixing in the chargino and therefore its contributions. In addition values 
of large tan 8 > 6 are excluded in this study. Although the results tend to asymptote for 
medium tan @ there is a region of very high tan 3 which is interesting from a theoretical 
point of view; it allows Yukawa unification at the GUT scale. Unfortunately this implies 
large b-quark Yukawas and hence the top-quark dominance approximation made for the 
low-medium tan ĝ studies is no longer valid. 


Before summarising the results, let us briefly discuss the main effects of supersym- 
metry. For the kaons, the most interesting effect is that one tends to find a depletion of 
é'/e, unfortunately contrasting with what we would perhaps like given the fact that the 
measured value is slightly larger than one might expect from the Standard Model (modulo 
uncertainties in hadronic matrix elements). The dominant contributions are from Z? and 
photo-penguin diagrams involving charginos and charged Higgs. The two contributions 
come with opposite sign (the chargino being positive), so that there are indeed regions 
where an enhancement over the SM value can occur, although this is only of the order 
of 10%. On the other hand the depletion can be up to 40%. A summary of the most 
interesting processes is given in Table 5. Again, the result for &'/e is plagued by hadronic 
uncertainties, whereas the rare decays are theoretically much cleaner places to search for 
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Process \Ratio w.r.t SM | max | min 
e'fe 0.53 | 1.07 

Kt > atv 0.65 | 1.02 

Kj, 3 wv 0.41 | 1.03 

Kj > n9ete- 0.73 | 1.10 

B > X,uii 0.73 | 1.34 

B 5 uji 0.68 | 1.53 


Table 5. Ratios of CMSSM to SM results for various processes. 


signals of supersymmetry. 

One additional interesting feature of Minimal Flavour models is that they predict an 
absolute lower bound of sin 28 > 0.42 [9, 8]. Such a lower bound is possible because one 
can generalise the expression for the £ hyperbola to include supersymmetric contributions. 


5 [0 — 2) 4m Fa + 0.31] A* By, = 0.266 (55) 
where 0.31 is the charm contribution. From 
R2 
2 = 1.6- 5 
Ri RF, (56) 


we get 


2n(1—p) | 16 E — 0.317(Fi) 67) 


028 = RE Hio [APB 
This function may now be plotted as a function of the parameter Fy which is a collection of 
all SM and SUSY contributions and is typically of order 1 in the CMSSM. (Note that it is 
not monotonic because the variables are not independent of Fy.) A value of sin 26 less than 
this value would have been the first indication of flavour structure in soft supersymmetry 
breaking and arguably an indirect detection of supersymmetry. Unfortunately, at the 
time of writing, the value of sin 28 coming from B — Ks measurements at Belle and 
BaBar indicate higher values. 


6 Beyond the CMSSM and MFV 


Up to this point we have been looking at the minimal case where all flavour and CP vio- 
lation in the MSSM is attributed to the Yukawa sector. However, an interesting question 
to ask is whether there might be other, non-universal scenarios, in which some of these 
processes are consistent with experimental limits. This is an important question because 
string models do not generally predict the favourable CMSSM scenario very naturally. 


There are a number of ideas for suppressing FCNC and CP processes apart from uni- 
versality. They are: cancellations of SUSY contributions amongst themselves, flavour 
alignment, particular flavour structures (e.g. Hermitian or off-diagonal phases), decou- 
pling and dynamical (ie. derived from minimising a string derived potential). In the 
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next subsection I will discuss cancellations with respect to EDMs and make some general 
comments. Then I will mention decoupling and in the following section discuss some 
recent developments in the last of these, FCNC and CP suppression in dynamical string 
derived patterns of supersymmetry breaking. 


6.1 Non-universality and cancellations 


When the flavour structure is non-universal at the GUT scale there can be cancellations 
to various processes. Intuitively it seems unlikely that they will occur simultaneously for 
different processes, and this indeed turns out to be the case when numerical studies are 
done, as we shall now see. 


Non-universality has been studied as a general idea in [10] but more recently there 
has been a resurgence of interest thanks to 'new' string inspired scenarios, motivated by 
explicit constructions of SM-like models in for example type I models. These models 
can lead to flavour non-universality which is relatively ‘well-controlled’, so naturally there 
has been some speculation as to whether a particular type of flavour non-universality 
might lead to cancellations ‘across the board’ as effectively as universality [11, 12]. The 
answer to this, at least for those cases that have been studied, unfortunately appears to 
be negative as we shall now see. 


The measurables that first received attention in this context were again the EDM’s of 
the neutron and electron, both of which have rather strict experimental bounds. In [11] it 
was noted that for particular patterns of supersymmetry breaking there can be a simul- 
taneous cancellation of these parameters, even if the CP violating phases on the jz term is 
large, much larger than would be allowed in the naive CMSSM with phases. This idea has 
since attracted some attention and has become known as the 'cancellation mechanism'. 


The analysis was based on supersymmetry breaking pattern inspired by so-called D- 
brane models in which the SU(2) gauge group was associated with a different brane from 
the SU(3) and U(1)y groups of the SM. The result is that the supersymmetry breaking 
gauginos masses can have different values at the string scale (assumed to be near the 
GUT scale). On the other hand the A-terms can have a good deal of universality. The 
overall pattern of supersymmetry breaking can therefore be assumed to be identical to 
that of the CMSSM with the exception of the gaugino masses which are of the form: 


Mi= M; = V3m3/2(cos 0) 8, e 
M, = V3m3/2(cos 0)85e?" , (58) 


where ma;? ~ My is the gravitino mass and the angles here are a convenient parameteri- 
sation of the distribution of supersymmetry breaking amongst the different fields. If we in 
addition allow an arbitrary phase, $,, for the u term, then one finds large values of phases 
where the neutron and electron EDMs cancel simultaneously as shown in Figure 7(a). 


Unfortunately, when additional experiments are considered, the cancellation region 
begins to be excluded. In this case the experiment in question is the bound on the mercury 
EDM [6]. This parameter also has a cancellation region, but unfortunately it tends to 
cut across the previous cancellation region, and all three EDMs are only consistent with 
bounds in the trivial small phase region [6]. 
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Figure 7. The cancellation regions for EDMS of the electron (dark vertical curves), 
neutron (lighter vertical curves overlapping those of the electron) and mercury atom (di- 
agonal lines). The parameters are tan 8 = 2, of simultaneous n,e, Hg EDM cancellation 
in mSUGRA for the same tan B and may». 


Note that in contrast, mMSUGRA (CMSSM with phases) has a region of cancellation 
for all three EDMs as shown in Figure 7b which allows any value of $4 but restricts 
$, < 71/100. (The weak constraint on $4 is a result of the small contribution of $4 to 
the EDMs, as explained earlier). Adding a small phase to any of the gauginos separates 
the regions of small n, e and Hg EDMs and destroys this cancellation. In both cases the 
cancellation regions decrease in size at more realistic, larger, tan 3. Thus for this example 
the cancellations are not ‘across the board’. Indeed, given the fact that the cancellation 
regions shift considerably depending on the adopted model of the neutron (valence-quark 
or parton), finding a (non-symmetry based) stringy underlying reason for cancellation 
would in any case be difficult. Although here I have concentrated on EDMs I see no 
reason why the conclusion would be different with any other CP or FCNC process unless 
there was some underlying flavour symmetry. 


6.2 Decoupling 


This possibility is based on the fact that as the supersymmetric particles become heavier 
their contribution to FCNC and CP processes begins to decouple. For example, even if 
one allows O(1) CP violating phases, the contribution to the EDMs will be negligible if 
the SUSY spectrum is sufficiently heavy [13]. Generally, SUSY fermions are required to 
be lighter than the SUSY scalars by, for example, cosmological considerations. So the 
decoupling scenario can be implemented with heavy sfermions only. Note that for FCNCs 
for example, the contribution is still proportional to 


mim (59) 
mi me 


where i and j label different flavours. Thus if, for example, we put down the suppression 
of AMxgysy to large first and second generation masses, we still require a proportionate 
universality in their values. So this scenario is in a sense ‘universality where we need it’. 
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Here the SUSY contributions to the EDMs are suppressed even with maximal SUSY 
phases because the squarks in the loop are heavy and the mixing angles are small. In 
Figure 8 we display the EDMs as functions of the universal scalar mass parameter my for 
the mSUGRA model with maximal CP-phases ¢, = ¢4 = 1/2 and mj. = A = 200GeV. 
We observe that all EDM constraints except for that of the electron require my to be 
around 5TeV or more. The mercury constraint is the strongest one and requires 


(™Mo)decoup!. ~ 10 TeV . (60) 


This leads to a serious fine-tuning problem. Recall that one of the primary motivations 
for supersymmetry was a solution to the naturalness problem. Certainly this motivation 
will be entirely lost if a SUSY model reintroduces the same problem in a different sector, 
i.e. for example a large hierarchy between the scalar mass and the electroweak scale. 
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Figure 8. EDMs as a function of the universal mass parameter mo. 1: electron. 2: 
neutron (chiral model). 3: neutron (parton model). 4: mercury. The ezperimental limit 
is given by the horizontal line. Here tan 8 = 3, mio = A = 200GeV and ¢, = ó4 = 1/2. 


The degree of fine-tuning can be quantified as follows. The Z boson mass is determined 
at tree level by 


1 2 mj = mj, tan? B 2 
2g = tan? 8 —1 (61) 
One can define the sensitivity coefficients [14],[15] 
ô lam 
i= 62 
^ Olna? (62) 


where a; are the high energy scale input parameters such as 71/2, ro, etc. Note that p is 
treated here as an independent input parameter. A value of c; much greater than 1 would 
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indicate a large degree of fine-tuning. The Higgs mass parameters are quite sensitive to 
mg so, for mp=10TeV, we find 


Cmo ~ 5000 (63) 


for the parameters of Figure 8. For the universal scalar mass of 5 (or 3) TeV the sensitivity 
coefficient reduces to 1300 (or 500). This clearly indicates an unacceptable degree of fine- 
tuning. 


This problem can be mitigated in models with focus point supersymmetry, that is 
when mj, is insensitive to mo [15]. However, this mechanism works for mo no greater 
than 2-3TeV which is not sufficient to suppress the EDMs. Another interesting possibility 
is presented by models with a radiatively driven inverted mass hierarchy, i.e. the models 
in which a large hierarchy between the Higgs and the first two generations scalar masses 
is created radiatively [16]. However, a successful implementation of this idea is far from 
trivial [17]. One can also break the scalar mass universality at the high energy scale [18]. 
In this case, either a mass hierarchy appears already in the soft breaking terms or certain 
relations among the soft parameters must be imposed (for a review see [19]). These 
significant complications disfavour the decoupling scenario as a way to solve the SUSY 
CP problem, yet it remains a possibility. 


Concerning the other phenomenological consequences, we remark that the SUSY con- 
tributions to the CP-observables involving the first two generations (such as &,&') are 
negligible, although those involving the third generation may be considerable. The corre- 
sponding CP-phases are constrained through the Weinberg operator contribution to the 
neutron EDM, which typically prohibits the maximal phase nh 7/2 while still 


allowing for smaller O(1) phases. 


6.3 New contributions to FCNC processes 


As well as the question of how SUSY contribution can be generally suppressed with non- 
universality, some attention has been paid to the question of whether non-universality 
might be able to explain excesses of, for example, &'/e. One of the reasons for the em- 
phasis on non-universality is that there are rather general arguments that show that, in 
the absence of a CKM phase, there are no new contributions to £, e'/e or hadronic B? 
asymmetries even with arbitrary soft phases. Thus supersymmetric 


CP violation is intimately connected with flavour non-universality. CP in the presence 
of new flavour structures was reviewed recently in [20], mainly concentrating on enhance- 
ments to £'/e. Contributions come from the chargino diagrams in Figure 9 as well as 
the usual gluino diagrams, and these were recently analvsed in [21]. Thus flavour and 
CP constrain low energy supersymmetry but at the same time provide interesting new 
areas to search for supersymmetry indirectly. Generally we do expect new CP violating 
phases in the SUSY sector; however the conclusion of these studies is that they will not 
produce sizable effects in £ and e'/e unless there is also some flavour structure in the 
SUSY breaking. 
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Figure 9. Chargino contributions to ¢'/e. 


7 The future 


Supersymmetry breaking is clearly the biggest factor in flavour and CP phenomenology 
and yet remains the biggest unknown. Ideally one would like to have a picture of CP 
and flavour from first principles but at the time of writing this seems some way off. 
Nevertheless there are two hints to do with CP violation that some progress might be 
possible. The first hint is that in the SM the CP violation is insufficient to account for the 
observed baryon asymmetry. Secondly, general arguments show that the four dimensional 
CP is a discrete gauge symmetry in string theory. Thus any viable string scenario must 
break CP spontaneously. (See [23] for the original observation, and some more recent 
references on the general connection between supersymmetry breaking and CP.) 


If the CP violation responsible for baryogenesis is associated with the supersymmetry 
breaking sector then at some point in the evolution of the Universe these phases must 
have been quite large. On the other hand the SUSY CP problem seems to indicate that 
today they are small. An interesting idea that connects with the stringy picture of CP 
violation is therefore that the spontaneous breaking of CP happens at the same time as 
supersymmetry breaking, and that at the minimum of the potential the CP phases are 
zero. Thus at the time of supersymmetry breaking there was sufficient time dependent CP 
violation to account for baryogenesis, but today these phases have been set dynamically 
to zero. Some progress in this direction has been made in recent investigations into the 
nature of CP in string theory and the precise way in which it can be violated, either by 
string theory moduli (describing the compactification) or by the Froggatt- Nielsen fields 
which are responsible for flavour structure in effective string models [24, 25]. There it 
was found quite generally that it is quite difficult to generate a flavour and CP structure 
in the Yukawa couplings (by whatever means) without at the same time generating large 
EDMs [25]. This problem, which is at once more specific and more serious than the 
supersymmetric CP problem seems to indicate that there is more to be learnt by adopting 
more of a ‘top-down’ approach. 


Eventually such approaches might shed some light on the origin of CP violation in 
string theory and even provide a direct connection between string theory and experiment. 
Certainly CP and flavour remain two of the most important aspects of supersynimetry 
phenomenology, and as we have seen may allow detection of supersymmetry indirectly 
before the LHC. Beyond that, should supersymmetry be found, experiments directed at 
flavour and CP violating phenomenology will help us to pin down the parameters of the 
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supersymmetry breaking sector and allow us to piece together the type of mechanism 
responsible for breaking and mediating supersymmetry, providing a window onto string 
theory. 


In these lectures I have given a purely personal overview of the subject and have 
omitted many topics, my main aim being to equip the reader with the techniques rather 
than to be comprehensive. To finish, I will try to make some amends by summarising some 
of these developments and directing the reader to relevant works. Muon dipole moments 
have been discussed in [26] where it has been shown that a muon EDM may in the future 
be at the 10-??e.cm level and hence measurable. The relations between dipole moment 
operators have been examined in [27] where it was shown that the muon anomalous 
magnetic moment is correlated with the electron EDMs. The question of (g — 2), in 
mSUGRA with and without phases and universality has been analysed in citearnl. In 
mSUGRA there are correlations between this quantity and B, > p~ u* (see Dedes et alin 
the same reference). Alternative approaches to spontaneous CP violation that concentrate 
instead on CP breaking in extensions of the MSSM with gauge singlet have been considered 
in [29]. Other possibilities for CP violation are reviewed in [22]. The neutrino sector (to 
which I have paid scandalously little attention) deserves a review of its own. As well 
as the direct interest in the flavour structure of the neutrino sector, various patterns of 
supersymmetry breaking in the neutrino sector may lead to observable effects in rare 
decays such as u — ey and p — eee, as well as interesting possibilities for CP and 
leptogenesis. The reader is referred to [30] and references therein. I have also been unable 
to cover R-parity violating models which obviously have interesting implication for CP 
and flavour. For a review see [33]. The masses of the Higgs sector are strongly dependent 
on the CP violating phase in the 4 term (something that was somewhat skated over in 
the text). This is discussed in [32]. Finally there are a host of other interesting ideas and 
I cannot even begin to summarise them. One of many intriguing examples is the idea 
that all fermion masses are actually the result of radiative corrections and therefore soft. 
This is discussed in [34]. 
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Appendix A: Conventions 


We here briefly recall the essential features of the Weyl notation for the discussion of 
Section 1. In the Weyl representation, a four component, Dirac fermion looks like 


Xa 
y= x? . 


where X and X, are two-component spinors (a, 3 = 1,2). The gamma matrices are 


0 o” 
Bd ; 64 
7 (3 0 ) (64) 


where o” = (1, 0t), J” = (1, —a*), and o? are the Dirac matrices. The ys matrix looks like 


wai ( | MI (65) 


so that the chiral projections (given by R/L = (1+ iy;)/2) on the fermions are, 


v= (1) me=(2] (66) 


The suffix ‘c’ refers to charge conjugate, since 
~io?X* 
Ue = -—Yh* = ( ° ; (67) 


so that Xea = -io th. The matrix 


0 1 
Eag = i035 = | i0 (68) 


acts like a metric tensor for the unbarred spinors. For example the invariant mass terms 
are 


wy = XP x, + h.c. 
= FX 4X_ the. 
= (Yh (Yra) + he. (69) 


Because of this we identify the complex conjugate of a field, x*? with Xê. (The last 
line we included to make the point that we may identify the two component spinors in 
terms of the original Dirac ones, so that (Yh). = yfe The dagger we shall reserve to 
indicate an operation on the Dirac spinors.) This means that (nX)! = X7. Similarly, the 
complex conjugates form their own representation of the Lorentz algebra, and have their 


226 Steven Abel 


own Lorentz invariant quantities. To avoid confusion the indices of the latter are written 
with a dot, so that 
= —io?,, (70) 


and, in general, Lorentz invariant quantities are written as 
Xna, X,m^, X"a" TO, Xa nO, s... (71) 


and so on. 


Appendix B: Superspace 


In the text we saw that the transformations of the (9, X, F) supermultiplet, could be 
expressed as the behaviour of a single scalar quantity, the superfield. The eight ‘coor- 
dinates’ (z,0,0) are known collectively as superspace. Supersymmetry transformations 
are equivalent to transformations in 8. Transformations in Minkowski space are gener- 
ated by the momentum P,, which is realised over space-time coordinates as a differential 
operator P, = —i0,, so that if z — z +a, then exp(ia-P) induces the transformation 
$(x) — (x) + a"0,o(x). In fact P, is the conserved charge associated with the transfor- 
mation x > z+ a, which, by Noether's theorem is 


OL 
P = [av f- EL 72 
H af H H ( ) 
where f represents summation over all fields (fermions and bosons) in the Lagrangian. In 
a similar way, transformations in 0 can be written in terms of differential operators acting 
in the superspace; 


bg =EQ+EQ (73) 
where 
| 00 aaa 
Qo = ET — on b On 
a E 59% u9 
Q = 96; 10° o^ Q,. (74) 


Acting (infinitesimally) with these operators on the coordinates gives the following trans- 
formations; 


dex = i008 — i€08 


ó = E 
ó&y = 240. (75) 


It is easy to check that, firstly, 


$(y,0) + 4O(y,0) = (y+ 2i00€,0 + £) 
= O(y,0) + dep + vV 205X + 005¢F (76) 
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so that the differential operators above do indeed produce the infinitesimal transforma- 
tions, and secondly, that the momentum is related to the supersymmetry generators as, 


(Qa. Q4) = 2io" ð, (77) 


all other anti-commutators vanishing. This is one piece of the supersymmetry algebra 
and leads to Equation 7. The remaining pieces are provided by the Poincaré algebra of 
normal special relativity (i.e. the algebra of commutations of the momentum P, and six 
M,, generators of the Lorentz group - three boosts Mio and three angular momentum Mi; 
- which represent currents conserved under a Poincaré transformation z^ — Az" + a^). 
The generator Qa has a current associated with it too, the super-current. This is simply 
the No-ether current for the infinitesimal supersymmetry transformations given above; 


OW, > 

Sou = -v2 (sauer -— Fy Xn (78) 
i 

with Qa = f dz? Sao- 

The algebra that we have described is called N — 1 supersymmetry, because there 
is one generator Qa. Models with N generators are referred to as having N-extended 
supersymmetry, and their multiplets become increasingly complicated. N > 8 is not 
normally considered because it requires helicities of 5/2 and higher. Besides, we also have 
more than one graviton for these cases. N = 4 is the largest value of N that we see coming 
from heterotic strings. However, N — 1 is favoured because it allows chiral generations 


to exist (in the sense that we may have different representations of left and right handed 
fields). 


Appendix C: Gauge invariant supersymmetry 


Using the technology presented in the main text, we are already able to construct gauge 
invariant interactions such as Yukawa couplings using the superpotential. However gauge 
invariance also requires some modification of the kinetic terms, as well as a Yang-Mills 
term. This appendix is devoted to the task of determining gauge invariant expressions for 
these and is included for completeness. I should point out that the calculations required 
to produce the results of this appendix become more tedious as we progress. Because of 
this it is not possible (and would not be very useful) to give their derivation in the same 
detail that was possible for the superpotential, and the main aim will be to present the 
flow of the argument.. 


Gauge invariant kinetic terms 


Our first task is to introduce vector-superfields. Vector superfields are the other convenient. 
way of deriving supersymmetry invariant Lagrangians. Like the chiral-superfields, they 
are dimensionless, but obey 

V (z,0,0) = V(x,0,0). (79) 


In fact (in a roundabout way) we already know how to construct a supersymmetry in- 
variant Lagrangian from such a field; the kinetic energy terms may be derived from the 
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vector superfield EE 
K(2,0,0) = $9. (80) 
Clearly this obeys (79), and Taylor expanded in 6s, it becomes 
K(z,0,9) = 4$*ó- V2(0x + OX) + 006* F 4-800 F* 
— 0c" 0 (xox + i¢d,0* — i$*8,) 


+ 660, (i/V/20,(a"**X.6*)) — 899^ (i/v29,(0%,,X"8)) 
— fi, — = 1l 1 1 
+ 0000 G (IKOX4+ DXOX)~ 5|9?+ 776+ 76° FO+FF*) . (81) 


Notice that the series terminates at the 0000 term, which is the highest possible term. 
Also notice that, apart from some total derivatives, the final term is the kinetic energy 
piece of the Lagrangian. Traditionally, the 0000 term of a vector field is referred to as the 
D-term, so that we may now write the kinetic energy more succinctly as 


Lip = $0 " (82) 


As we have seen this term transforms into a total derivative under a supersymmetry 
transformation. In fact this is a general property of D-terms, which is perhaps not so 
surprising when we realise that D-terms, like F-terms, are auxiliary fields which do not 
propagate. 

In order to show this we need to find the supersymmetry transformations for the 
components of an arbitrary vector superfield, which we can write in its Taylor expanded 
form as, 


V(z,0,9) = C(z)-40C(x) + 0C(z) 
+ 66G(x) + 88G* (x) — 6o^Bu, (2) 
+ i068, (X(2)* 0 5^8,6.) — 00" (Ac (z) - c.a) 
+ 0688 E " 1906) (83) 


The spin one component v^ lends its name to the entire vector superfield. In gauge 
superfields, the components A, \ and v* will be the gaugino, anti-gaugino and gauge boson 
respectively. There are a number of ways to determine the supersymmetry transformations 
of the components in this multiplet. We could perform the same operation as in the text 
for the chiral supermultliplet (i.e. begin with ôC (x£) = £C + £C, and continue up through 
the multiplet, checking at each stage that the supersymmetry algebra operates correctly 
on each component), or we could deduce it by comparison with K (z,0,0), or most simply, 
we could extract the components of 


V (z4Fióc£ —i£c0, O+€, O+€) = V(z,0,8) - &V = V(z,0,0) + (£Q-- £EQ)V (84) 


where Q, Q are the differential operators defined in the Introduction. The result for the 
last three components is 


BALE) = i&LD^ ~ (omes 
évA(z) = -i(£o"X" + A^a"zi) - 0, (£o^ C^ + C") 
óD^(z) = (EoD A^ + 0,A^c"C). (85) 
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The D-term transforms into a total derivative as required. Having determined this fact, 
let us now try to introduce gauge invariance into our kinetic term. Under a gauge trans- 
formation, we wish our supermultiplets to retain all their properties, so that the natural 
extension of the transformation 


$i & = (e^), 0; (86) 


(different A, with Ai; = M TÀ, and A, i,j as group indices) is 


0; 90 = (e^), ; (87) 


where now A is a chiral superfield (remember that the product of two chiral superfields 
is another chiral superfield). From now on we shall drop the explicit gauge indices. The 
first thing to notice is that our kinetic term 


K'(z,0,0) = Bee (88) 


is not gauge invariant, since the two exponentials do not cancel. It can be made gauge 
invariant by sandwiching another superfield with the correct transformation properties, 
between the chiral superfields. This field is the gauge-superfield, and since the kinetic 
term must transform as a vector superfield, it is also a vector superfield. Conventionally 


the gauge superfield is defined as gVi; = gaVaT i, such that the kinetic term is 


K(z, 0,0) = Be?9" $. (89) 


The constant g will turn out to be the conventional coupling constant. For gauge invari- 
ance, under a gauge transformation V must transform so that it cancels the troublesome 
phases in Equation 88. That is, we must have 
e?9V. y gi 29V e, (90) 
We now have nearly all the ingredients we need for our Lagrangian. The kinetic energy 
term is found, as before, by extracting the D-term which transforms as a total derivative, 
but is now gauge invariant too, 


LKE = Be” G| | . (91) 


In order to extract the D-term, we need to expand the exponential To do so, we first 
make a gauge transformation such that C, C, C, G = 0. As we shall see below, by examining 
infinitesimal gauge transformations, one finds that this gauge, the Wess-Zumino gauge, is 
always possible. In the Wess-Zumino gauge, things are particularly simple since, 


Va(z,0,8) = -00 Bo, (1) + B8B (2) — 1890A + 59680D(-x), 
Va Vg = — 560800 a 
VsVaVo = 0 (92) 


so that e799” = 1+ V + V?/2. After some lengthy manipulations and Fierz-rearranging, 
the kinetic terms in the Lagrangian are found to be 


Liz = -—iXo"D,X — (D o) Do + F*F 
+ iv2g (ó* Ax — XXad) + góDo*. (93) 
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Note that there are suppressed indices floating around here. For example we have 
99DO* = Zay gA6;DAT2 0; and gó* AX = Ya gap T, A4X;4 where ga allows for group 
factors and U(1) charges. The derivatives are the usual covariant ones; 


Ding = óiO, + igaT; VAn 
Daas = ÓAnÓ,- gafgcv™. (94) 


In order to maintain the Wess-Zumino gauge, supersymmetry transformations must be 
accompanied by the required gauge transformation. That is, one performs an infinitesi- 
mal supersymmetry transformation on the gauge superfield. This generates new C, C, C, G 
terms which are found to cancel for a certain additional infinitesimal gauge transforma- 
tion on V. 'Thus the purpose of Wess-Zumino gauge is to make explicit all the physi- 
cal degrees of freedom. (Explicitly, the supersymmetry transformations give XC = 0, 
^ = = io Et, ôG = 2£A, and the gauge transformation which cancels it is given by 
= = (0,75 50" £V, —£A).) The combined transformation we shall refer to as a supergauge 
anctor ron ze. Roughly speaking, this changes the derivatives in the supersymmetry 
transformations into covariant ones. That is, for the chiral superfields we have 


ôg o = V2EX 
oOx = iV2o"ED, 6 + V2EF 
SCF = iV 2€a"D,X + 2g OdE, (95) 


and for the gauge boson and gauginos and D-field we have 


DCAC) = i£,D^— porta" Eko 
ô viz) = -i Cý — A^g''zi) 
ESD (z) = (£o"D5uA? + DArPaté) . (96) 


The Yang Mills terms 


This completes the kinetic terms for the matter fields. The final terms that we need are 
the Yang-Mills terms. These can be found as in the text by forming an additional chiral 
superfield YV, (y, 0) out of the three physical fields, A2, Fá, and D^. Since the gaugino 
is the component with lowest dimension, we begin by “defining: 


W,(2,0,0) = —iAa(z). (97) 


This chiral superfield is a function with dimension 3/2 and helicity 1/2, and is the field- 
strength superfield. Its general form may be found by repeating what we did in the main 
text but this time for finite supergauge transformations. Making an arbitrary supergauge 
transformation, and using the transformation rules for the components above we find, 
Wily, O) = ôg W2(z, 0) 
i En 
= if(y) + ba (sep + ye" FA) 80c" DACKE — (98) 


a^ uv 


where 
A A A A A,B 
Fis = Oyu, — 0,v, — gAfBcU, Uy - 
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The required, supergauge invariant, Lagrangian can be formed in the usual way by taking 
the F-component of a combination of these superfields; it is given by 


p) 


1 ~ 1 
ue — i^o" DC AG + 5D^Da (99) 


il 


1 Awe 
fy = DT (wawil, + WWE 


in the Wess-Zumino gauge. The full N = 1 supersymmetric, gauge invariant Lagrangian 
is given by the sum of equations 91,16,99: 


L = Lge + Lin + LyM (100) 


This is the main result of this Appendix. Those who would rather avoid any discussion of 
superfields at all, could simply verify that this Lagrangian is indeed invariant under the 
general supergauge transformations given above. Notice again that the auxiliary D field 
may be easily removed by using its equations of motion, 


D^ = gab} Tidy. (101) 
The remainder of this Appendix expands a little on the role played by the chiral W, field; 


it may be skipped at the reader’s discretion. 


In standard gauge theory, the gauge field is the connection required for covariant. 
derivatives. These are quantities which, under a gauge transformation 


di > & = (°) 65. (102) 
transform covariantly: l 
Dydi (Dé) = (e^), Dud. (103) 


The connection is everything other than the derivative in D, = 0, — igv,, and the trans- 
formation of v, is therefore defined so that 
D, = (e?) D, (e?) (104) 
Quite generally, there is only one non-trivial tensor that can be made from the the con- 
nection, by taking its covariant derivative. This is the curvature, which is given by 


ij jo 


Fy, = D,Ay - D,A,. (105) 


So much for standard Yang-Mills theory, but what about its supersymmetric counterpart? 
The field strength superfield W,, may also be related to the connection in a super/gauge 
covariant derivative (just as the usual field strength F,, is related to the gauge bosons). 


We begin with the derivatives which anti-commute with Q, and Q^ which we shall attempt 
to make gauge covariant; these are 


ð uz 
Da = age tiaa? On 


DÈ = -35 ciat, (106) 
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We can easily verify that (D4, Qa} = {Da Qa} = (D4,Q4) = (D4,Q«) = 0. In addition 


Dy = 0. This means that an alternative (and in fact the usual) definition of a chiral 


superfield is D, &(y,0) = 0. The fact that D,D = 0 gives us a useful chiral projection 
operator, 


I 
P= -QP. (107) 


which, when applied any function of z,0,0, turns it into a chiral superfield (that is a 
function of y and @ only). Now consider the derivative 


D, à. (108) 


This derivative is not gauge covariant. A covariant derivative is one which transforms as 
follows under gauge transformations: 


Vie = Vle = e^(V,9). (109) 
Thus l l 
Vi = eve. (110) 


a> 


By examining the gauge transformations of V in Equation 90, we see that 
Va =e 78" p e?9V (111) 


transforms in the correct way, where D, acts on everything to its right. The connection 
superfield is defined as the difference between this term and Da. i.e. 


Ty = e729" (D, e9V), (112) 
Finally the chiral field W, which we derived earlier is the chiral projection of this: 
Wa = Fra. (113) 


The gauge invariance of the Yang-Mills term is now manifest. 


8 Appendix D: R-symmetry and R-parity 


As well as the terms in the superpotential of the MSSM there are many other Yukawa 
couplings that we could have included but didn’t. For example, 


W problem — pL eH, + NELT EL + DL 'Q 4 A" ea U^ D* DS, (114) 


where a, b, c are colour indices, are all invariant under SU(3), x SU(2); x U(1)y. Including 
all of the these terms would lead to disastrously fast proton decay (suppressed by only the 
inverse squark mass squared). Thus, counting over generation indices, we have to explain 
the absence of 30 possible couplings in the MSSM superpotential. This can be done by the 
introduction of a global U(1)g symmetry known as R-symmetry. An R-transformation 
may be defined as a transformation of the spinor coordinates, 


0 — e? 
8 > ec. (115) 
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Under this transformation we may assign a charge, the R-charge, to each of the chiral 
superfields: 


d 


9 (r, 0,8) = Fi o (x, 0,0). (116) 
For the elements of the supermultiplet this means 


(ài, Xi, Fi] — eli fb, eX e 2 E) 
(o, X, FX} — e P (of eX e^ E). (117) 


Since the elements transform differently this also means that R transformations do not 
commute with supersymmetry transformations (i.e. in order to be consistent the su- 
persymmetry generators in Appendix A must obey Q — e '*Q which may easily be 
checked). Now let us examine which of the terms in the Lagrangian is invariant under 
R-transformations. The $ kinetic term is invariant only if we assign R, = 0 to the gauge 
vector, since Rg = — Rz. Examining the kinetic terms in, for example, Equation 22, we 
find À — e'^A. The supersymmetric Yang-Mills Lagrangian (shown in Appendix C) is 
also invariant under this transformation. This leaves the interaction term which looks like 


Lint = W(®)|, + h.c. 
AijkPiP; Oe |. + he. (118) 


By looking at the transformations in (117), the interaction Lagrangian transforms as 
AijkPiPj Pel pe > eio. Ri-2) Ai D; Oj Pel p (119) 


and therefore only the terms with 


DR =2 (120) 


survive in the R-invariant superpotential. A choice of charges which results in the MSSM 
is 
R; = 3B; + Lj (121) 

where B; and L; are the baryon number and lepton number of ®;. Clearly Equation 120 is 
not satisfied by any of the 30 couplings in Equation 114. When supersymmetry is broken 
(for example by a Majorana mass for the gauginos), a residual discrete symmetry remains 
in the Lagrangian, given by 

fi + (-1)9e3Betb f, (122) 
where f; is an arbitrary field of spin S;. This particular choice of discrete symmetry is 
called R-parity or matter-parity. Other (equally valid) choices are said to be R-parity 
breaking. For example, one particularly interesting choice, called baryon-parity is [35], 


fi 2 (—1)?5 138 p. (123) 


In this case only the first two terms in Equation 114 survive, but nucleon decay is absent. 
This model is interesting because this discrete symmetry can arise in certain string models 
(for more details see [35, 36]). 
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1 The Standard Model and B decays 


Studies of b and c physics are focused on two main goals. The first is to look for new 
phenomena beyond the Standard Model and the second is to measure Standard Model 
parameters including CKM elements and decay constants. These lectures concern “B 
Phenomenology," a topic so broad that it can include almost anything concerning b quark 
decay or production. I will cover an eclectic ensemble of topics that I find interesting and 
hope will be educational. 


1.1 Theoretical basis 


The physical states of the “Standard Model” are comprised of left-handed doublets con- 
taining leptons and quarks and right handed singlets (Rosner 2001) 


u c t 
( ) (:) (o) , UR, dg, CR, SR, tr, bg (1) 
d L s L L 
[2 T T » _ i 
( ) (^) ( | ER, HR TR: Ven, Vur Vr R- (2) 
Ve L Vy L Vr L 


The gauge bosons, W+, y and Z° couple to mixtures of the physical d, s and b states. 
This mixing is described by the Cabibbo-Kobayashi-Maskawa (CKM) matrix (Kobayashi 
1973). 


The Lagrangian for charged current weak decays is 


Las AW. + h.c., (3) 


where 
eL dy, 


JE = (De, Dy, V.) Y"Vuaus | up. | + (Ur, Ex, tL) Y¥Vexm | sr (4) 


TL b, 
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and 
Vaa Vus Vab 
Vexm = | Va. Ves Va |- (5) 
Va Vis Veo 


Multiplying the mass eigenstates (dz, Sz,bL) by the CKM matrix leads to the weak 
eigenstates. Vung is the analogous matrix required for massive neutrinos (we will not dis- 
cuss this matrix any further). There are nine complex CKM elements. These 18 numbers 
can be reduced to four independent quantities by applying unitarity constraints and the 
fact that the phases of the quark wave functions are arbitrary. These four remaining num- 
bers are fundamental constants of nature that need to be determined from experiment, 
like any other fundamental constant such as a or G. In the Wolfenstein approximation 
the matrix is written to order A? for the real part and A‘ for the imaginary part as 
(Wolfenstein 1983) 


1—A?/2 À AX (p — in)(1 — à?/2) 
Vckw = -A 1 — A/2 — iA? M Ad?(1 + ind?) . (6) 
AAXY(1- p — in) —AX 1 


The constants À and A are determined from charged-current weak decays. The measured 
values are À = 0.2205 + 0.0018 and A=0.784+0.043. There are constraints on p and 7 
from other measurements that we will discuss. Usually the matrix is viewed only up to 
order à. To explain CP violation in the K? system the term of order A‘ in Ves is necessary. 


i TN 


H Q! 


v 
Figure 1. Diagram for muon decay. 


1.1.1 Determination of Gr 


Muons, being lighter than the lightest hadrons, must decay purely into leptons. The 
process is 47 — e` TeV, as shown in Figure 1. The total width for this decay process is 
given by 

2 


G 
D,- 192,37 x (phase space) x (radiative corrections). (7) 


Since I, 7, = fi, measuring the muon lifetime gives a direct measure of Gr. 


1.1.2 Determination of |V,,,| 


A charged current decay diagram for strange quark decay is shown in Figure 2. Here 
the CKM element Vas is present. The decay rate is given by a formula similar to Equa- 
tion (7), with the muon mass replaced by the s-quark mass and an additional factor 
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Figure 2. Semileptonic K decay diagram. 


of |V,,[?. Two complications arise since we are now measuring a decay process involving 
hadrons, K^ — z?^e^r rather than elementary constituents. One is that the s-quark mass 
is not well defined and the other is that we must make corrections for the probability that 
the ü-spectator-quark indeed forms a 7° with the u-quark from the s-quark decay. These 
considerations will be discussed in greater detail in the semileptonic B decays section. 
Fortunately there are theoretical calculations that allow for a relatively precise measure- 
ment because they deal with hadron rather than quark masses and have good constraints 
on the form-factors. Using these models we have A = V,, = 0.2205 + 0.0018. 


1.1.3 Using semileptonic B decays to determine V., and V, 


Semileptonic b decays arise from a similar diagram to Figure 2, where the b quark replaces 
the s quark. In this case the b quark can decay into either the c quark or the the u quark, 
so we can use these decays to determine V and V, providing we have three ingredients: 


» B lifetimes, 
» relevant B branching fractions, 


e a theory or model to take care of hadronic physics. 


1.2 Lifetime measurements 


The “b-lifetime” was first measured at the 30GeV e*e^ colliders PEP and PETRA, where 
b-quarks were produced via ete” annihilation into bb pairs. Experiments at these colliders 
measured the average lifetime of all b-hadron species using the distribution of the “im- 
pact parameter", which is the minimum distance of approach of a track to the primary 
production vertex. This distance is related to the lifetime (Atwood 1994). 


A more direct measurement would be to measure the actual decay distance L and the 
momentum of the b hadron. Since L = y@ct, where t is the decay time of the individual 
particle (also called the proper time), and 8 = p/E and y = E/m,, the distribution of 
decay times ¢ can be determined. Events will be distributed exponentially in £ as eti, 
with 7 being the lifetime. Uncertainties come from errors on L, the b hadron momentum, 
and contributions from non b-quark backgrounds. 


Precision lifetimes of individual b-flavored hadrons have been measured at LEP where 
the production process is e* e^ — Z° — bb and at CDF in 1.8TeV pp collisions. Large 
samples of semileptonic decays have been used to determined the B? and B- lifetimes. 
Note that the CPT theorem guarantees that the lifetime of the anti-particles is the same 
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ALEPH 


16 
t (ps) 


Figure 3. Proper time distributions of exclusive semi-leptonic decays from ALEPH. 


as the particles. The decay distributions for two semileptonic B decay channels are 
shown in Figure 3. The B? + D*-£*v channel has mostly signal with some background 
from B^ decays and other backgrounds as indicated in the figure. It takes a great deal of 
careful work to accurately estimate these background contributions. The clear exponential 
lifetime shapes can be seen in these plots. Some data has also been obtained using purely 
hadronic final states (Sharma 1994). 


B hed 1.653+0.028 ps 
p? He 1.548+0.032 ps 
B, | 1.493+0.062 ps 
^, | HH 1.23+0.08 ps 


= +0.34 
E, ELT ————————]4 1.39 s PS 
11 12 13 L4 LS 16 17 18 


Figure 4. Measured lifetimes of different B species. 


A summary of the lifetimes of specific b-flavored hadrons is given in Figure 4 (Groom 
2001). Note that the ratio of B^ to B° lifetimes is 1.074+0.028, a 2.60 difference from 
unity, which is quite significant. Also, the A, lifetime is much shorter than the B° lifetime. 
According to proponents of the Heavy Quark Expansion model, there should be at most 
a 10% difference between them (Bigi 1997). To understand lifetime differences we must 
first analyze hadronic decays. 
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1.3 B decay mechanisms 


Figure 5 shows sample diagrams for B decays. Semileptonic decays are depicted in Fig- 
ure 5(a), when the virtual W^ materialises as a lepton-antineutrino pair. The name 
“semileptonic” is given, since there are both hadrons and leptons in the final state. The 
leptons arise from the virtual W^, while the hadrons come from the coupling of the spec- 
tator anti-quark with either the c or u quark from the b quark decay. Note that the B 
is massive enough that all three lepton species can be produced. The simple spectator 
diagram for hadronic decays (Figure 5(a)) occurs when the virtual W- materialises as a 
quark-antiquark pair, rather than a lepton pair. The terminology simple spectator comes 


W , 
b ds v Ham d u 
= = ü,c 
——— con Ie a es 
- q 
a) simple spectator b) hadronic: color suppressed 
ü vds d u 
c) annihilation d) W exchange 
b - u,c,t d 
_ W 3 Sw. - 
d b 


u,c,t 


e) box: mixing 


f) Penguin 
Figure 5. Various mechanisms for B meson decay. 


from viewing the decay of the b quark, while ignoring the presence of the spectator an- 
tiquark. If the colors of the quarks from the virtual W^ are the same as the initial b 
quark, then the color suppressed diagram, Figure 5(c), can occur. While the amount. of 
color suppression is not well understood, a good first order guess is that these modes are 
suppressed in amplitude by the color factor 1/3 and thus in rate by 1/9, with respect to 
the non-color suppressed spectator diagram. 


The annihilation diagram shown in Figure 5(c) occurs when the b quark and spectator 
anti-quark find themselves in the same space-time region and annihilate by coupling to a 
virtual W-. The probability of such a wave function overlap between the b and ü-quarks 
is proportional to a numerical factor called fg. The decay amplitude is also proportional 
to the coupling V,,. The mixing and penguin diagrams will be discussed later. 


Each diagram contributes differently to the decay width of the individual species, 
although the simple spectator diagram is always expected to be the dominant one. Note 
that for b baryons there are even more diagrams than those shown in Figure 5. Currently 
there is no direct evidence for diagrams (c) and (d), although (c) is expected to be 
responsible for the purely leptonic decay B^ — 7^7. 


The semileptonic decay width, I, is defined as the decay rate in units of inverse 
seconds into a hadron (or hadrons) plus a lepton-antineutrino pair. Decay rates can also 
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be expressed in units of MeV by multiplying by A. The width, Ta, is related to the 
semileptonic branching ratio B, and the lifetime 7 by: 


Pa = Ba ou = By/r. (8) 


The semileptonic width should be equal for all b species. This is true for D° and Dt 
mesons, even though their lifetimes differ by more than a factor of two. Thus, it is 
differences in the hadronic widths among the different b hadrons that drive the lifetime 
differences. 

Let us now consider the case of B^ and B- lifetime differences. There is some indi- 
cation that the B has a shorter lifetime, which would imply that there are more decay 
channels available. Figure 6 shows the color allowed and color suppressed decay diagrams 
for two-body decays into a ground-state charmed meson and a m~. The color suppressed 
diagram only exists for the B^. The relative rate 

[(B- > D?z^) 
T(B° > Dtr-) 


=18+0.3, (9) 


and the same is true for all other similar two-body channels, such as D*p-. Thus we 
would expect, if most B decays are given by these diagrams that the B^ would have a 
shorter lifetime than the P, opposite to what the data suggests. An explanation is that 
this ratio reverses for higher multiplicity decays, but this is an interesting discrepancy 
that needs to be kept in mind. 


b Wa? alm b l Ln. 
Ie 7? - D* (qd) cM 


dl.-; 
IJD” (q-u) aye en 


Figure 6. (left) Spectator diagram for B’ > Dn and B- > D'r. (right) Color 
suppressed spectator diagram for B7 — Dr~ only. 


2 Semileptonic B decays 


2.1 Formalism of exclusive semileptonic B decays 


The same type of semileptonic charged current decays used to find Vas are used to find Vo 
and Væ. The basic diagram is shown in Figure 5(a). We can use either inclusive decays, 
where we look only at the lepton and ignore the hadronic system at the lower vertex. 
or exclusive decays where we focus on a particular single hadron. Theory currently can 
predict either the inclusive decay rate, or the exclusive decay rate when there is only a 
single hadron in the final state. 


Now let us briefly go through the mathematical formalism of semileptonic decays. We 
start with pseudoscalar B to pseudoscalar m transitions. The decay amplitude is given 
by Grinstein (1986) and Gilman (1990): 


A(B > me v) = A (10) 
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where 
LY = & 7" (1 — ys) vv, (11) 


and 
H, = (m|Jhaa(0)[B) = f (PP + p), + f- (PP — p), (12) 


where q? is the four-momentum transfer squared between the B and the m, and P, p are 
four-vectors of the B, m respectively. H, is the most general form of the hadronic matrix 
element. It is written in terms of the unknown f functions that are called “form-factors.” 
It turns out that the term multiplying the f_(q) form-factor is the mass of lepton squared. 
Thus for electrons and muons (but not 7s), the decay width is given by 


d'a. G2)Vyl2K? 
Ta = CEPS e oy (13) 


where 


1 24,2]? 
K= a7 (u$ +m? -gqy — 4m MB) (14) 
is the momentum of the particle m (with mass m) in the B rest frame. In principle, 
dUa/dg? can be measured over all q?. Thus the shape of f,(q°) can be determined 
experimentally. However, the normalisation, f,(0) must be obtained from theory, for 
Vij to be measured. In other words, 


Ts, MiP f Kala? da? (15) 


where g(q?) = f.(q?)/f+(0). Measurements of semileptonic B decays give the integral 
term, ae the lifetimes are pons separately, allowing the product |Vij]?] f. (0)]? to 
be experimentally determined. 


For pseudoscalar to vector transitions there are three independent form-factors whose 
shapes and normalisations must be determined (Richman 1995). 


2.2 Measurement of |V.»| 
2.2.1 Heavy quark effective theory and B > D*£^v 


We can use measurements of exclusive B decays, coupled with “Heavy Quark Effective 
Theory" (HQET) to find V4 (Isgur 1994). We start with a quick introduction to HQET. It 
is difficult to solve QCD at long distances, but it is possible at short distances. Asymptotic 
freedom, the fact that the strong coupling constant o, becomes weak in processes with 
large q?, allows perturbative calculations. Large distances are of the order ~ 1/Agcp ~1 
fm, since Agcp is about 0.2GeV. Short distances, on the other hand, are of the order of 
the quark Compton wavelength; Ag ~ 1/mg equals 0.04 fm for the b quark and 0.13fm 
for the c quark. 

For hadrons, on the order of lfm, the light quarks are sensitive only to the heavy 
quark's color electric field, not the flavor or spin direction. Thus, as mg — oo, hadronic 
systems which differ only in flavor or heavy quark spin have the same configuration of 
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their light degrees of freedom. The following two predictions follow immediately (the 
actual experimental values are shown below): 


mp, — mg, = mp,— mp (16) 
(90.2-:2.5) MeV —— (99.24: 0.5) MeV , 


mh. — mj = mo- mh. (17) 


0.49 GeV? 0.55 GeV?. 


The agreement is quite good but not exceptional. Since the charmed quark is not that 
heavy, there is some heavy quark symmetry breaking. This must be accounted for in 
quantitative predictions, and can probably explain the discrepancies above. The basic 
idea is that if you replace a b quark with a c quark moving at the same velocity, there 
should only be small and calculable changes. 


In lowest order HQET there is only one form-factor function F which is a function of 
the Lorentz invariant four-velocity transfer w, where 


| Mi t Mi. — d 


The point w equals one corresponds to the situation where the B decays to a D* which 
is at rest in the B frame. Here the “universal” form-factor function F(w) has the value, 
F(1) = 1, in lowest order. This is the point in phase space where the b quark changes to 
a c quark with zero velocity transfer. The idea is to measure the decay rate at this point, 
since we know the value of the form-factor, namely unity, and then apply the hopefully 
small and hopefully well understood corrections. 


In HQET, the form factor F can also be applied to the analysis of B — DED, but the 
overall decay rate for the pseudoscalar final state is only 40% of D*£- 7 ,and the decay 
rate vanishes at w = 1 much faster, making the measurement more difficult. 


2.2.20 Detection of B > D*£v 


Since this is a semileptonic final state containing a missing neutrino, the decay cannot be 
identified or reconstructed by merely measuring the 4-vectors of the final state particles. 
One technique used in the past relies on evaluating the missing mass (M M) where 


MM? = (Eg - Ep« ~ E) ~ (pg — Pp« — Px)” (19) 
= M? + M? x + M? — 2Eg5 . (Ep t Ej) + 2E px Ee 
—2pp*'Pe + 2pa:(Ppx + Py). 


For experiments using ete” — T(4S) + BB, the B energy, Eg, is set equal to the beam 
energy, Ebeam, and all quantities are known except the angle between the B direction and 
the sum of the D* and lepton 3-vectors (the last term). A reasonable estimate of M M? is 
obtained by setting this term to zero. The signal for the D*£v final state should appear 
at the neutrino mass, namely at M M? = 0. In an alternative technique M M? is set to 
zero and the angle between the B momentum and the sum of the D* and lepton 3-vectors 
is evaluated as 

2Ep(Epx + Er) - Mj — Mis, 


cos (85.5*;) = 2|pal |(Pp* + Pe )| 


, (20) 
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where M, indicates the invariant mass of the D*-lepton combination. 


A Monte-Carlo simulation of cos (e B.p* d is given for the final state of interest and 
for the main background reaction in Figure 7. For the correct final state only a few events 
are outside the “legal” region of +1, while when there are extra pions in the final state 
the shape changes and many events are below —1. 
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Figure 7. The cosine of the angle between the B momentum vector and the sum of D* 
and lepton momentum vectors for (left) the final state D*év and (right) D* Xtv, where 
X refers to an additional pion. 


Recent CLEO data has been analyzed with such a technique. The data in two specific 
w bins is shown in Figure 8. The final result for all w bins is shown in Figure 9. The 
result is characterised by both a value for F(1)|V4| and a shape parameter p°. 
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Figure 8. The signal and background Figure 9. The CLEO results for D*£v 
contributions in two different w bins for for both B and B^. The curve is a fit 


the final state D*£v. to a shape suggested in (Caprini 1998). 
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Figure 10. The correlation between the slope parameter p? and F(1)|V4|. The contours 
are for a change in the fit X? of one unit. 


2.2.3 Evaluation of Va using B — D*£v 


Figure 10 and Table 1 give recent experimental results on exclusive B > D*év decays. 
The CLEO results are not in particularly good agreement with the rest of the world 
including the BELLE results. 


Experiment F(1)|Va| (x1073) p 
ALEPH (Buskulic 1997) 33.0 + 2.14 1.6 0.74 x 0.3 0.4 
BELLE (Tajima 2001) 354£1.941.9 1.35 0.17 + 0.18 
CLEO (Heltsley 2001) 422+132+1.8 1.61: 0.09 + 0.21 
DELPHI (Abreu 2001) 34.5 t 1.401 2.5 1.24: 0.1 0.4 
OPAL(z£)(Abbiendi 2000) 37942134224 1.2+£02404 
OPAL (Abbiendi 2000) 37.5 X 1.7 E 1.8 1.4 0.2 + 0.2 
Average 37.8 + 1.4 1.37 0.13 


Table 1. Modern determinations of F(1)]|V4| using B — D*£^v decays. 


To extract the value of |V| we have to determine the corrections to F(1) that lower 
its value from unity. The corrections are of two types: quark mass, characterised as 
some coefficient times Agcp/mg, and hard gluon, characterised as 4. The value of the 
form-factor can then be expressed as (Neubert 1996) 


F(1) = na (1 +0- Aqcp/mq + c2- (Agen/me)’ +...) = nall + 6). (21) 


The zero coefficient in front of the 1/mg term reflects the fact that the first order correc- 
tion in quark mass vanishes at w equals one. This is called Luke's Theorem (Luke 1990). 
Recent estimates for 74 and 6 are 0.96740.007 and —0.55+0.025, respectively. The value 
predicted for F(1) is then 0.9140.05. This is the conclusion of the PDG review done by 
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Artuso and Barberio (Artuso 2001). There has been much controversy surrounding the 
theoretical prediction of this number. In the future Lattice-Gauge Theory calculations will 
presumably become more accurate when unquenched calculations become possible. Cur- 
rent lattice calculations are done in the quenched approximation, without including light 
quark loops. They give F(1) = 0.913*0025 + 0.016*0002*0000*0.208. where the uncertain- 
ties come respectively, from statistics and fitting, matching lattice gauge theory to QCD, 
lattice spacing dependence, light quark mass effects and the quenching approximation. 
(Hashimoto 2001). 


Using the Artuso-Barberio value for F(1) we have 
|Va| = (41.5 + 1.5 2.3) x 107%, (22) 


where the first error is experimental and the second the error on the calculation of F(1). 


2.2.4 |V.| from inclusive semileptonic decays 


The inclusive semileptonic branching ratio b — Xév has been measured by both CLEO 
and LEP to reasonable accuracy. CLEO finds (10.49 + 0.17 + 0.43)%, while LEP has 
(10.56 + 0.11 +0.18)%. These are not quite the same quantities as the CLEO number is 
an average over B^ and B? only, while the LEP number is a weighted average over all b 
hadron species produced in Z° decays. Thus using the LEP number one should use the 
average “b quark” lifetime of 1.560--0.014ps. 


Using the Heavy Quark Expansion (HQE) model (Bigi 1997) we can relate the total 
semileptonic decay rate at the quark level to |V]: 


IVa = h(Ai Az, A) x T(b ctv) (23) 


= h(Aj,AÀs, A) x B(b > ctv) x t. 

b 
where B(b — clv) is the inclusive semileptonic branching ratio minus a small b + ulv 
component. To be precise, it is the decay of a B meson to a lepton-antineutrino pair plus 
any charmed hadron. 7 is the lifetime of that particular meson or average lifetime of 
the combination of the b-flavored hadrons used in the analysis, suitably weighted. In the 
HQE model the semileptonic rate is described to order (Aocp/ m)? by three parameters: 


e A, is the kinetic energy of the residual motion of the b quark in the hadron: 
M 
= — (Bio) B(v)), (24) 


M=5 hy(iD)*hy 


e Ag is the chromo-magnetic coupling of the b quark spin to the gluon field: 


do = E (Bt) 


D. - o"! Gh B(s)) (25) 


- A is the strong interaction coupling, given in terms of Mg = (Mg + 3M y«)/4, the 
spin averaged B meson mass, as 


MEN M 
= Mpg - —. 
A B-m,c 2M; (26) 
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These parameters are further related by 


Mg = my +A- Ata) 


Ài — Ag 
my ` 


B (27) 
M px = mt+A-— 


These relations allow us to determine A; from the M p» — Mg mass splitting as 0.12GeV?. 
The function (A1, À?, A) can be calculated from the Heavy Quark Expansion (HQE) 
model. This involves both perturbative and non-perturbative pieces. 


Although we will go through this derivation of V4, there is a disturbing assumption 
of quark-hadron duality; the idea of duality is that if you integrate over enough exclusive 
charm bound states and enough phase space, the inclusive hadronic result will match the 
quark level calculation. However, we do not know what size is the uncertainty associ- 
ated with the duality assumption. In fact, Isgur said “I identify a source of Agcp/mq 
corrections to the assumption of quark-hadron duality in the application of heavy quark 
methods to inclusive heavy quark decays. These corrections could substantially affect the 
accuracy of such methods in practical applications and in particular compromise their 
utility for the extraction of the CKM matrix element Va” (Isgur 1999). 


Let us move to the details of the calculation. In one implementation A, and A are 
derived; the relationship between the inclusive b > cév branching fraction (about 99% of 
b + Xfv) is given as 


(b — Xlv) [1.55 ps 0.5 GeV? 
IVa| = 0.0411, cis N em (1 um Gr) (28) 


x (1 £ 0.0155... + 0.010,,, + 0.012, jms, ) 


(29) 


where x? is the negative of A; modulo QCD corrections and is taken as (0.5 + 0.1) GeV? 
(Bigi 1997). This leads to a value of 


[Væl = (40.7 + 0.5 + 2.4) x 10? (30) 


from the LEP data alone, with a similar value from CLEO. The first error contains the 
statistical and systematic error from the experiments while the second error contains an 
estimate of the theory error from sources other than duality. 


In another implementation the parameters À; and A are obtained from data. Here we 
use the HQE formula 


G? |V|? MŽ 


= . 31 
Ta Te x (31) 
A Lu à do 
Ws 165—- 3.185 40.02% 
h- 1547 “1.6557 (1- 1.087— :) -09575 3. ag 09b | 


Determining À; and A can be accomplished by measuring, for example, the “moments” 
of the hadronic mass produced in b — clv decays. The first moment is defined as the de- 


— L2 
viation from the D mass (Mp) as (M — Mp) and the second moment as (ui — Mp) ) 
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It is also possible to use the first and second moments of the lepton energy distribution 
in these decays, or moments of the photon energy in the process b — sy. In fact any 
two distributions can be used; in practice it will be critical to use all of them to try and 
ascertain if any violations of quark-hadron duality appear and to check that terms of 
order (Agcp/m;)? are not important. 


CLEO has used the first and second moments of the hadron mass in b — clv decays. 
They find the Mx distributions by using missing energy and momentum in the event 
to define the v four-vector. Then detecting only the lepton and requiring it to have a 
momentum above 1.5GeV/c, they calculate: 


Mi = (Es-E- E) - (pg - b - (32) 
= M? + MÈ -2Eg(E,4 E,) + 2pp: (Pe — P) 
& Mic Mj -2EgpEe, (33) 


where Me is the invariant mass of the lepton-neutrino pair. The measured MẸ distri- 
bution is shown in Figure 11. We do not see distinct peaks at the mass of the D and 
D* mesons because ignoring the last term in Equation (32) causes poor resolution. This 
term must be ignored, because using this technique, we do not know the direction of the 
B meson. 


-— 


-4 -2 0 4 6 8 10 
M? (GeV) 


Figure 11. The mass distribution for b — clv events from CLEO. Xy indicates an 
additional pion plus a D or D*. 


CLEO finds values of the first and second moments of (0.287+0.023+0.061) GeV? 
and (0.712+0.056+0.176)GeV‘, respectively. These lead to the determination of A,, A 
and V4 shown in Figure 12. Later we will see a different determination using b — sy 
(Section 5.3.1). 


In summary the exclusive measurements of Va are to be trusted while the inclusive 
determination, though consistent, has an unknown source of systematic error and should 
not be used now. 
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Figure 12. Constraints in Aj versus A from CLEO measurements of first and second 
hadronic moments in semileptonic decay. The darker band gives ezperimental uncertain- 
ties alone, while the lighter outer band also includes uncertainties from unknown 3rd order 
theoretical parameters. Bands of constant Va and b quark mass, ma, are also shown, where 
the band widths represents theoretical uncertainties due to unknown 3rd order parameters. 


2.3 Measurement of |V,| 


This is a heavy to light quark transition where HQET cannot be used directly as in finding 
Væ. Unfortunately the theoretical models that can be used to extract a value from the 
data do not currently give precise predictions. 


Three techniques have been used. The first measurement of V, done by CLEO (Fulton 
1990) and subsequently confirmed by ARGUS (Albrecht 1990), used only leptons which 
were more energetic than those that could come from b > c& p decays. These “endpoint 
leptons” can occur b — c background free at the Y (4S), because the B’s are almost at rest. 
The CLEO data are shown in Figure 13. Since the lepton momentum for B — Dév decays 
is cut off by phase space, this data provides incontrovertible evidence for b > uév decays. 


Unfortunately, there is only a small fraction of the b — u£ 7» lepton spectrum that 
can be seen this way, leading to model dependent errors. The models used are either 
inclusive predictions, sums of exclusive channels, or both: see Isgur (1995), Bauer (1989), 
Kórner (1989), Melikhov (1996), Altarelli (1982) and Ramirez (1990). The average among 
the models is |Vip/Ve5| = 0.079 + 0.006, without a model dependent error. These models 
differ by at most 1196, making it tempting to assign a +6% error. However, there is no 
quantitative way of estimating the error. 


ALEPH (Barate 1999), L3 (Acciarri 1998) and DELPHI (Abreu 2000) isolate a class 
of events where the hadron system associated with the lepton is enriched in b — u and 
thus depleted in b — c. They define a likelihood that hadron tracks come from 5 decay by 
using a large number of variables including, vertex information, transverse momentum, 
not being a kaon etc.. Then they require the hadronic mass to be less than 1.6GeV, which 
greatly reduces b — c, since a completely reconstructed b — c decay has a mass greater 
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Figure 13. Sum of inclusive electron and muon distributions from CLEO. The solid 
points are data taken on the peak of the Y(4S) while the open circles are data taken on 
the continuum 30MeV below the resonance(suitably normalised). The dashed line is a fit 
to the continuum data and the solid line is the predicted curve from b — clv dominated 
by B — Dév near the end of the allowed lepton spectrum. 
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Figure 14. The lepton energy distribution in the B rest frame from DELPHI. The data 
have been enriched in b — u events, and the mass of the recoiling hadronic system is 
required to be below 1.6GeV. The points indicate data, the light shaded region, the fitted 
background and the dark shaded region, the fitted b > u£v signal. 


than that of the D (1.83GeV). They then examine the lepton energy distribution, shown 
in Figure 14 for DELPHI. 

The average of all three results is |V,,| = (4.13*022^ 043-023 +0.20) x 1075, resulting in a 
value for |Vi»/Ves| = 0.102 +0.018, using |V| = 0.0405 + 0.0025. I have several misgivings 
about this result. First of all the experiments have to understand the systematic errors 
very well. To understand semileptonic 6 and c decays and thus find their b > ulv 
efficiency, they employ different models and Monte Carlo manifestations of these models. 
To find the error they take half the spread that different models give. This alone may be 
a serious underestimate. Secondly they use one model, the HQE model, to translate their 
measured rate to a value for |V,,,|. This model assumes duality, and there are no successful 
experimental checks: The model fails on the A, lifetime prediction. Furthermore, the 
quoted theoretical error, even in the context of the model, has been estimated by Neubert 
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to be much larger at 1096 (Neubert 2000). Others have questioned the effect of the hadron 
mass cut and estimate 10-2096 errors due to this alone (Bauer 2001). 


It may be possible to use the spectrum of photons in b — sy to reduce the theoretical 
error in the endpoint lepton method or to make judicious cuts in q? instead of hadronic 
mass to help reduce the theoretical errors: see Wise (2001). 


The third method uses exclusive decays. CLEO has measured the decay rates for the 
exclusive final states 7@v and plv (Alexander 1996). The data are shown in Figure 15. 
The model of Körner and Schuler (KS) is ruled out by the measured ratio of p/m. Other 
models include those of Isgur (1995, 1989), Wirbel (1985), Bauer (1989), Korner (1988), 
Melikhov (1996), Altarelli (1982) and Ramierz (1990). CLEO has presented an updated 
analysis for plv where they have used several different models to evaluate their efficiencies 
and extract Vi». These theoretical approaches include quark models, light cone sum rules 
(LCRS), and lattice QCD. The CLEO values are shown in Table 2. 
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Figure 15. The B candidate mass distributions and the signal bin lepton momentum 
spectra (insets) for the pion modes (top) and the sum of p and w (vector) modes (bottom). 
The points are the data after continuum and fake background subtractions; the dark- 
shaded, cross-hatched and unshaded histograms are b > cX, b — ulv feed-down and 
signal, respectively. For the x (vector) modes, the light-shaded and hatched histograms 
are T — T (vector vector) and vector — v (n — vector) cross-feed, respectively (charge 
final states can feed neutral and vice-versa). The histogram normalisations are from the 
nominal fit. The arrows indicate the momentum cuts. 
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Model Vap (x10) 

ISGW2 (Isgur 1989) 3.23 + 0.14*02 
Beyer/Melikhov (Beyer 1998) 3.32 + 0.15*02) 
Ligeti/Wise (Ligeti 1996) 2.92 + 0.13*030 
LCSR (Ball 1998) 3.45 + 0.15*03; 
UKQCD (Debbio 1998) 3.32 + 0.14*031 


Table 2. Values of |V,j| using B — p£^v and some theoretical models. 


The uncertainties in the quark model calculations (first three in the table) are guessed 
to be 25-5096 in the rate. The Ligeti/Wise model uses charm data and SU(3) symmetry 
to reduce the model dependent errors. The other models estimate their errors at about 
30-50% in the rate, leading conservatively to a 25% error in |Vas|. Note that the models 
differ by 1896, but it would be incorrect to assume that this spread allows us to take a 
smaller error. It may be that the models share common assumptions, e.g. the shape of the 
form-factors. At this time it is prudent to assign a 2596 model dependent error realising 
that the errors in the models cannot be averaged. The fact that the models do not differ 
much allows us to comfortably assign a central value |V,»| = (3.252-0.14*02 +0.80) x 107?, 


and a derived value |Vus/ Væl = 0.08 + 0.02 . 


Lattice QCD has predicted form-factors and resulting rates for the exclusive semilep- 
tonic final states £r and pév (Sachrajda 1999) in the quenched approximation. These 
calculations require the momentum of the final-state light meson to be small in order to 
avoid discretisation errors. This means we only obtain results at large values of the invari- 
ant four-momentum transfer squared, q?. Figure 16 shows the predictions of the B — plv 
width as a function of q?. Note that the horizontal scale is highly zero suppressed. The 
region marked “phase space only” is not calculated but estimated using a phase space 
extrapolation from the last lattice point. 


3 -^ —— — — 


nm 


EN 


dr/dg (ps /GeV^y|V, 


qIGeV? 


Figure 16. The UKQCD lattice calculation for B — plv shown as circles. The line is 
an estimate. 


The integral over the region for q? > 14GeV? gives a rate of AT = 8.3 |V|? ps-!-GeV?. 
CLEO measurements in the same interval give (7.12:2.4) x 10 5ps-!-GeV?, yielding a value 
for Va = (2.92: 0.5) x 107? (Sachrajda 1999). Ultimately unquenched lattice calculations 
when coupled with more precise data will yield a much better value for Vis. 
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3 Facilities for b studies 


3.1 b production mechanisms 


Although most of what is known about b physics has been obtained from e*e^ colliders 
operating either at the Y (4S) or at the Z^ pole, interesting information is now appearing 
from the hadron collider experiments, CDF and D0, which were designed to look for 
considerably higher energy phenomena. The appeal of hadron colliders arises mainly 
from the large measured b production cross-sections. At the T (4S) the total b production 
cross-section is only 1.05 nb, whereas at the FNAL Tevatron collider, with 1.8TeV in the 
pp center-of-mass, the b cross-section has been measured as ~100yb. Tt is expected to be 
about five times higher at the LHC (Artuso 1994). 


The different production mechanisms of b quarks at various accelerators leads to dif- 
ferent methods of measurement. Figure 17 shows the production of B- and B" mesons at 
the Y (45), while Figure 18 shows the production mechanism of the different b species at a 
higher energy ete” collider such as LEP. Figure 19 shows the production mechanisms for 
a b or c quark at a hadron collider. The third order diagrams appear to be as important 
as the second order diagrams and the overall theoretical calculation gives about 1/2 of 


the measured value. 
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Figure 17. B production at the Y (45). 
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Figure 18. b production in the continuum at e*e^ colliders. 
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Figure 19. Heavy quark, Q, production at hadron colliders. Left four diagrams are 
second order in as, while right four are third order. 
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3.2 Accelerators for b physics 


Experiments on b decays started with CLEO and ARGUS using e*e™ colliders operating 
at the Y(4S). They were quickly joined by the PEP and PETRA machines operating 
around 30GeV. Table 3 lists some of the machines used to study b quarks in the last 
century (Artuso 1994). 


Machine Beams Energy o(b) b fraction £ Total # 
(GeV) cm ?s^! — b-pairs 

CESR ete 10.8 1.05 nb 0.25 13x10? 9.8 x 108 
DORIS ete” 10.8 1.05 nb 0.23 ~ 10%! — 0.4 x 108 
PEP ete” 29 0.4 nb 0.09 3.2x10?! 

PETRA ete” 35 0.3 nb 0.09 1.7x10! 

LEP ete 91 9.2 nb 0.22 2.4x10? 1.8 x 10° 
SLC ete” 91 9.2 nb 0.22 3.0x10% 8.8 x 104 


TEVATRON pp 1800 100 ub 0.002 3x10?! 


Table 3. Machines used for b physics in the 20th century. The total number of bb pairs 
accumulated per experiment is also listed when known. 


In the year 2000 the PEP II and KEK-B storage rings began operation. These ma- 
chines have separate e~ and e* magnet rings so they can operate at asymmetric energies; 
PEP II has beam energies of 9.0GeV and 3.1GeV, while KEK-B has energies of 8.0GeV 
and 3.5GeV. This allows the B mesons produced at the {(4S) to move with a velocity 
B ~ 0.6, which turns out to be very important in measurements of CP violation, since 
time integrated CP violation via mixing must be exactly zero due to the C odd nature of 
the Y(4S). These machines are operating at very high luminosities. Current and future 
machines for B physics are listed in Table 4. The CDF and DO experiments will continue 
at the Tevatron with higher luminosities. CDF has already made significant contributions 
including studies of b production, lifetimes and the discovery of the B; meson (Abe 1998). 


Machine Exp. Beam Energy a(b) b £(Design) Interactions 
(GeV) fraction  cm^?s ^! per crossing 
PEPII BABAR ete 108 1.05nb 1/4  3xl10?(l) «1 
KEK-B BELLE ete- 10.8 1.05 nb 1/4 1094 «1 
HERA HERA-b pN 800 10nb 2.1079 4 
Tevatron BTeV pp 2000 100 ub 1/500 2x10? 2 
LHC LHCb Dp 14000 500 ub 1/160 2x 10°? 0.6 


1 Machine has already exceeded design luminosity. 


Table 4. Machines in use or approved for dedicated b physics experiments. 
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BTeV and LHCb will go into operation around 2007 with much larger event rates. The 
CMS and ATLAS experiments at the LHC will also contribute to b physics especially in 
the early stages when the luminosity will be relatively low; at design luminosity these ex- 
periments have an average of 23 interactions per crossing making the finding of detached 
vertices difficult. 


3.3 ete- detectors 


Most experiments at ete” storage rings look quite similar. CLEO II, shown in Figure 20, 
was the first detector to have both an excellent tracking system and an excellent elec- 
tromagnetic calorimeter. Starting from the inside there is a thin beryllium beam pipe 


Muon Chambers 
Outer Iron 


Inner Iron Magnetic Coil 
Barrel Crystals 


Time of Flight 


Return Iron 


Central ` 
Dritt Chamber 


Beam Pipe 
Silicon Vertex Detector 


Figure 20. Electrons view of the CLEO II detector. 


surrounded by a silicon vertex detector; this detector measures positions with an accu- 
racy of zz 10um. Then there is a wire drift chamber whose main function is to measure 
the curving trajectories of particles in the 1.5T solenoidal magnetic field. The next device 
radially outward is time-of-flight system to distinguish pions, kaons and protons. This 
system only works for lower momenta. The next device is an electromagnetic calorimeter 
that uses Thallium doped CsI crystals; indeed this was the most important new technical 
implementation done in CLEO II and has also been adopted by BABAR and BELLE. 
Afterwards there is segmented iron that serves as both a magnetic flux return and a filter 
for muon identification. 


The most important advance in the new CLEO III, BELLE and BABAR detectors 
is much better charged hadron identification. Each experiment uses different techniques 
based on Cherenkov radiation to extend x/K separation up to the limit from B decays. 
Figure 21 shows a view of the BELLE detector parallel to the beam. 
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Figure 21. The BELLE detector. 


3.4 b production characteristics at hadron colliders 


To make precision measurements, large samples of b’s are necessary. Fortunately, these are 
available. With the Fermilab Main Injector, the Tevatron collider will produce œ% 4 x 10! 
b hadrons/10’ s at a luminosity of 2 x 10°2cem~*s~!. These rates compare very favorably to 
ete” machines operating at the Y (4S). At a luminosity of 10*4cm~?s~! they would produce 
2x 109 B’s/10’ s. Furthermore B,, Ay and other b-flavored hadrons are accessible for study 
at hadron colliders. The LHC has about a five times larger b production cross-section 
than the Tevatron. Also important are the large charm rates, ~10 times larger than the 
b rate. 


In order to understand the detector design it is useful to examine the characteristics 
of b quark production at pp collider. It is often customary to characterise heavy quark 
production in hadron collisions with the two variables p, and 7, where n = — In (tan (6/2)), 
and @ is the angle of the particle with respect to the beam direction. According to QCD 
based calculations of b quark production, the B's are produced “uniformly” in 7 and 
have a truncated transverse momentum spectrum, p, characterised by a mean value 
approximately equal to the B mass (Artuso 1994). The distribution in 7 is shown in 
Figure 22(a). 

The flat 7 distribution hides an important correlation of bb production at hadronic 
colliders. In Figure 22(c) the production angles of the hadron containing the 6 quark is 
plotted versus the production angle of the hadron containing the b quark according to 
the Pythia generator. Many important measurements require the reconstruction of a b 
decay and the determination of the flavor of the other b, thus requiring both b's to be 
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Figure 22. (a) The B yield versus q. (b) By of the B versus n. (c) The production 
angle (in degrees) for the hadron containing a b quark plotted versus the production angle 
for a hadron containing b quark. 


observed in the detector. There is a very strong correlation in the forward (and backward) 
direction: when the B is forward the B is also forward. This correlation is not present in 
the central region (near 90°). By instrumenting a relative small region of angular phase 
space, a large number of bb pairs can be detected. Furthermore the B's populating the 
forward and backward regions have large values of 8y Figure 22(b). 


B TeV is a dedicated heavy flavor experiment approved to run at the Fermilab Tevatron 
collider, uses two forward spectrometers (along both the p and p directions) that utilise 
the boost of the B's at large rapidities. This is of crucial importance because the main way 
to distinguish b decays is by the separation of decay vertices from the main interaction. 
LHCb, approved for operation at the LHC, needs a larger detector to analyze the higher 
momentum decay products, and thus has only one arm. 


3.4.1 The BTeV detector description 


I will describe BTeV here though LHCb shares many of the same features. There are 
difficulties that heavy quark experiments at hadron colliders must overcome. First of all, 
the huge b rate is accompanied by an even larger rate of uninteresting interactions. At 
the Tevatron the b-fraction is only 1/500. In addition all b experiments have problems 
searching for rare decay processes, at the level of parts per million, where the background 
from other b decays is dominant. To observe such decays the large data rate of b's must be 
handled. For example, BTeV, has 1 kHz of b’s into the detector, and these events must be 
selected and written out. The electromagnetic calorimeter must be robust enough to deal 
with the particles from the underlying event and still have useful efficiency. Furthermore, 
radiation damage can destroy detector elements. 


The BTeV Detector is shown in Figure 23 (Skwarnicki 2001) and the LHCb detector 
in Figure 24 (Muheim 2001). The central part of the BTeV detector has a silicon pixel 
detector inside a 1.5T dipole magnet. The LHCb experiment uses silicon strips. The BTeV 
pixel detector provides precision space points for use in both the offline analysis and the 
trigger. The pixel geometry is sketched in Figure 25(a). Pulse heights are measured 
on each pixel. Prototype detectors were tested in a beam at Fermilab, and excellent 
resolutions were obtained (Figure 25(b)), especially when reading out pulse heights (Appel 
2001). The final design uses a 3-bit ADC for this. 
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Figure 23. Schematic of the BTeV detector. 
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Figure 24. A schematic diagram of the LHCb detector . Key: 1, verter detectors; 
2,aerogel and gas RICH's; 3, magnet yoke; 4, coils; 5, magnetic shielding; 6, tracking 
chambers; 7, gas RICH; 8, EM-calorimeter; 9, hadron calorimeter; 10, muon system. 


PT 
E 50 um 
(a) Doublet position (b) S CM E 
c 
along beam E aniio 
312 
meer v 2 bit ADC 
a 


i 
Ej 


(—) Used in MonteCarlo 
B L . 


Beams 


-0.1 o 0.1 0.2 0.3 0.4 0.5 0.6 
Track Angle (rod) 


Figure 25. (a) Pizel detector geometry in BTeV. The detector is inside the beam pipe. 
(b) The spatial resolution as a function of the incident track angle for both 2-bit and 8-bit 
ADC's as measured in an 800GeV/c pion beam. The straight lines are piecewise fits to 
the data used in the Monte Carlo simulation. The dashed line near the top indicates the 
resolution obtainable without using pulse height information. 
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The pixel tracker provides excellent vertex resolution which is good enough to trigger 
on events with detached vertices characteristic of b or c decays. BTeV shows a rejection of 
100:1 for minimum bias events in the first trigger level while accepting about 5096 of the 
usable b decays. A good explanation of the trigger algorithm can be found at the website 
ww w-btev.fnal.gov/public documents/animations/Animated Trigger/. ^ Further trigger 
levels reduce the background by about a factor of twenty while decreasing the b sample 
by only 1096. The trigger system stores data in a pipeline that is long enough to ensure 
no deadtime. The data acquisition system has sufficient throughput to accommodate an 
output of 1kHz of 6’s, 1kHz of c’s and 2kHz of junk. Tracking is accomplished using straw 
tube wire chambers with silicon strip chambers in the high track density region near the 
beam. 


Charged particle identification is done using a Ring Imaging CHerenkov detector. A 
gaseous C4F;o radiator is used with a large mirror that focuses light onto a plane of 
photon detectors; these are currently Hybrid Photo-Diodes. They have a photocathode 
and a 20KV potential difference between the photocathode and a silicon diode that is 
segmented into 163 individual pads. The photoelectron is accelerated and focused onto 
the diode yielding position information for the initial photon. The system will provide four 
standard-deviation kaon/pion separation between 3-70GeV /c, electron/pion separation 
up to 22GeV /c and pion/muon separation up to 15GeV/c. Because protons don't radiate 
until 9GeV/c they can't be distinguished from kaons below this momenta. BTeV is 
considering an additional liquid CgF 4 radiator, 1cm thick, in front of the gas along with 
a proximity focused phototube array adjacent to the sides of the gas volume, to resolve 
this ambiguity. 

BTeV also has an excellent Electromagnetic calorimeter made from PbWO, crystals, 
based on the design of CMS. Finally, the Muon system is used to both identify muons 
and provide an independent trigger on dimuons (BTeV 2000). 


3,0 * . 
4 B?—B mixing 
4.1 Introduction 
Neutral B mesons can transform to their anti-particles before they decay. The diagrams 
for this process are shown in Figure 26 for the By. There is a similar diagram for the B,. 
Although u, c and t quark exchanges are all shown, the £ quark plays a dominant role 


mainly due to its mass, as the amplitude of this process is proportional to the mass of 
the exchanged fermion. 


b d b tou. d 
d ted ^ leu d b 
j LE J 


Figure 26. The two diagrams for By mizing. 


Under the weak interactions the eigenstates of flavor, which are degenerate in pure 
QCD, can mix. Let the quantum mechanical basis vectors be (|1),|2)) = {|B°), [B^)). 
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Then the Hamiltonian is 


i M M i(T T 
u-M-z-[.. "jJ-i|. P]. (34) 
2 My, M Dn, T 


[S se 


Diagonalising we have 
Am = mg, — mg, = 2|Myj|. (35) 


Here H refers to the heavier and L to the lighter of the two weak eigenstates. 

Bg mixing was first discovered by the ARGUS experiment (Albrecht 1983), although 
there was a previous measurement by UAI indicating mixing for a mixture of B} and B? 
(Albajar 1987). At the time it was quite a surprise, since m, was thought to be in the 
30GeV range. It is usual to define R as probability for a B? to materialise as a B? divided 
by the probability it decays as a B°. The OPAL data for R (Akers 1995) are shown in 
Figure 27. 
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Figure 27. The ratio, R, of like-sign to total events as a function of proper decay time, 
for selected B —^ D** Xt v events. The jet charge in the opposite hemisphere is used to 
determine the sign correlation. The curve is the result of a fit to the mixing parameter. 


Data from many experiments has been combined by a LEP Working Group to ob- 
tain an average value of Ama = (0.489 + 0.0008) x 10!?hs^!. Values from individual 
experiments are listed in Figure 28. 
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T T T 77 T T T 
ALEPH ' — p 0.446+0.020+0.018 ps”! 
(3 + 1 prel) Yj 
DELPHI * - 0.504+0.019+0.012 ps 
(4 +1 prel) 
L3 —- 0.444:0.028:0.028 ps" 
(3) 
OPAL z 0.479+0.018+0.015 ps” 
(5 
Bu i 0.507::0.023:£0.019 ps” 
pre 
a RN m 0.495+0.026+0.025 ps” 
+2 pre 
BABAR * R 0.503+0.009+0.011 ps” 
(2 prel) 


BELLE 


0.463+0.008+0.016 ps” 
(1) 


^ 


average of above 
after adjustments 


ARGUS+CLEO 
(Xa measurements) 


0.489-40.009 ps” 


0.489+0.032 ps! 


world average 0.489+0.008 ps! 


" working group average 4 
without adjustments Am, (ps ) 


Figure 28. Values of the Ba mizing parameter Ama for each experiment. 


The probability of mixing is given by (Gaillard 1974) (Bigi 2000) 


Am (G 


2 m2 
T= ond Bo frasrolVi Vani F (i Nacps (36) 


where Bg is a parameter related to the probability of the d and b quarks forming a 
hadron which must be estimated theoretically, F is a known function which increases 
approximately as m?, and rjgcp is a QCD correction, with a value of about 0.8. By far 
the largest uncertainty arises from the unknown decay constant, fg. In principle fg can 
be measured, since the decay rate of the annihilation process B^ — f^ is proportional 
to the product of f2|V.4?. In practice, even if V, were well known, this is a very difficult 
process to measure. Our current best hope of determining fg is to rely on unquenched 
lattice QCD which can use the measurements of the analogous Dt — u*v decay as check. 
Measurements of these D decays requires the construction of a ^r-charm factory." 


Since the Ba mixing measurement determines: 


[Vis Veal? x |(1 — p — in)? = (p - 1? +r, (37) 


B phenomenology 263 


it can be related to the CKM parameters; giving a circle centered at (1,0) in the p—7 
plane. This could in principle be a very powerful constraint. Unfortunately, the parameter 
Bg is not experimentally accessible, and fg, although in principle measurable, has not 
been and may not be for a very long time, so it too must be calculated. The errors on 
these calculations are still quite large. 


4.2 B, mixing in the Standard Model 


B? mesons can mix in a similar fashion to BY mesons. The diagrams in Figure 26 are 
modified by substituting s quarks for d quarks, thereby changing the relevant CKM matrix 
element from Vig to Vis. The time dependent mixing fraction is 

Am, G 2 


2 m 
ts = E = 8:3 Bg, flans avant (355) NQCDs (38) 


which differs from Equation (36) by parameters relevant for the B, rather than the Bg. 


Measuring x, allows us to use ratio of rg/z, to provide constraints on the CKM 
parameters p and 7. We still obtain a circle in the (p,n) plane centered at (1,0): 


Vial? = Æ [(1 — p)? + n] (39) 
V 2 
is 


However we must calculate only the SU(3) broken ratios Bpg,/Bp, and f5,/fp,. 


B? mixing has been searched for at LEP, the Tevatron, and the SLC. The probability, 
P(t) for a B, to oscillate into a B, is given by: 


P(t) (B, ^ B.) = Pret" [1 + cos (Amst)], (40) 


where t is the proper time. 


To combine different experiments a framework has been established where each ex- 
periment finds amplitudes A for a spectrum of test oscillation frequencies, w, defined 
as 

1 
P(t) = ghe [1 + Acos (wt)]. (41) 


Figure 29 shows the world average measured amplitude A as a function of the test 
frequency w = Am, (Leroy 2001). For each frequency the expected result is either zero for 
no mixing or one for mixing. No other value is physical, although allowing for measure- 
ment errors other values are possible. The data do indeed cross one at a Am, of 16ps-!, 
however here the error on A is about 0.6, precluding a statistically significant discovery. 
The quoted upper limit at 95% confidence level is 14.6 ps^!. This is the point where the 
value of A plus 1.645 times the error on A reaches one. Also indicated on the figure is the 
point where the error bar is small enough that a 4c discovery would be possible. This is 
at Am, = llps™!. One should also be aware that all the points are strongly correlated. 


The upper limit on Am, translates into an upper limit z, «21.6, also at 95% confidence 
level. CDF plans to probe higher sensitivity and eventually LHCb and BTeV can reach 
values of ~80. 
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Figure 29. Combined experimental values of the amplitude A versus the test frequency 
w = Amg as defined in Equation 41. The inner (outer) envelopes give the 95% confi- 
dence levels using statistical (statistical and systematic) errors. The “sensitivity” shown 
at 18.3ps^! is the likely place a 95% c.l. upper limit could be set. Also indicated is the 
mazimum value, 11ps~!, where a 4a discovery would be possible. 


5 Rare b decays 


5.1 Introduction 


These processes proceed through higher order weak interactions involving loops, which are 
often called “penguin” processes, for unscientific reasons (Lingel 1998). A Feynman loop 
diagram is shown in Figure 30 that describes the transition of a b quark into a charged 
-1/3 s or d quark, which is effectively a neutral current transition. The dominant charged 
current decays change the b quark into a charged +2/3 quark, either c or u. 


Ww 
b sd 


Figure 30. Loop or "penguin" diagram for a b — s or b — d transition. 


The intermediate quark inside the loop can be any charge +2/3 quark. The relative 
size of the different contributions arises from different quark masses and CKM elements. 
In terms of the Cabibbo angle (A=0.22), we have for the b — s case t:c:u £z A?:d?:\4. The 
mass dependence favours the t loop, but the amplitude for c processes can be quite large 
223096. Moreover, as pointed out by Bander, Silverman and Soni (1979), interference can 
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occur between £, c and u diagrams and lead to CP violation. In the Standard Model it 
is not expected to occur for b — s, due to the lack of a CKM phase difference, but could 
occur for b — d. It is always worth looking for such CP violation effects; all that needs to 
be done, for example, is to compare the number of B^ — K*-y events with the number 
of Bt + K**y events. 

There are other ways for physics beyond the Standard Model to appear. For example, 
the W- in the loop can be replaced by some other charged object such as a Higgs; it is 
also possible for a new object to replace the t. 


5.2 Standard Model theory 


In the Standard Model the effective Hamiltonian for the intermediate £ quark is given by 
(Deshpande 1994) 


4G 
Ha = Ug VeVe » Clu (42) 


Two of the more important operators are 


O= iyi d. Or= — SO wb F” (43) 


167 


The matrix elements are evaluated at the scale uj = My and then evolved to the b mass 
scale using renormalisation group equations, which mix the operators: 


Cilu) = $ Vis (p, Mw)C;(Mw). (44) 


5.3 b sy 


This process occurs when any of the charged particles in Figure 30 emits a photon. The 
only operator which enters into the calculation is C?(u). We have for the inclusive decay 


4G 
Ha = p (VisViel” Cz(mi)O: (45) 
Os = 167 — MBb” (46) 
G2.am? 2 
T'(b — 87) au dcn IA (47) 


It is far more difficult to calculate the exclusive radiative decay rates, but they are much 
easier to measure, Note that the reaction B — Ky would violate angular momentum 
conservation, so the simplest exclusive final state is B — K*+. 


Different techniques are used for reconstructing exclusive and inclusive decays and 
unique methods are invoked for exclusive decays on the Y(4S). At other machines the 
decay products, i, from an exclusive B decays. are used to reconstruct an “invariant mass” 
via M? = Y E? — Y; p?. At the T(4S) the decay products are first tested to see if the 
sum of their energies is close to the beam energy, Ey. If this is true then the “beam 
constrained" invariant mass is calculated as M? = E2,,, — Y; p?. These methods can 
be used for all exclusive B decays. Figure 31 shows the BELLE data for the reaction 
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Figure 31. BELLE data for the reaction B > pra. (a) The correlation between AE 
and M. The box shows the signal region. (b) The projection in AE for events in the M 
signal region; the line shows a fit to the background. (c) The M distribution for AE in 
the signal region; the line shows a fit to the background. 


B? + D*tn-, where the D*+ + 1+D°. BELLE first selects events with candidate D°’s. 
Then they require an additional x* where the measured mass difference between the D°x+ 
minus the D? candidate is consistent with the known mass difference. Selecting the D** 
candidates they combine them with candidate v^. Figure 31(a) shows the the correlation 
between the difference in measured energy AE = Y;; E; — Ebeam and the beam-constrained 
invariant, mass. In Figure 31(b) AE is shown after selecting the signal region in M, and 
in Figure 31(c) M is shown after selecting on AE. These plots show how clean signals 
can be selected. 


CLEO first measured the exclusive rate into K *(892)y (Ammar 1993) shown in Fig- 
ure 32. Here several different decay modes of the K*(892) are used. The current world 
average value for B(B — K*y) = (4.2 + 0.8 + 0.6) x 10-5. 


Events/ 3MeV 
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Figure 32. First published CLEO data for the reaction B — K*y showing the M 
distribution for AE in the signal region. 


To find inclusive decays two techniques are used. The first one, which provides the 
cleanest signals, is to sum the exclusive decays for the final states Ky, where n < 4 and 
only one of the pions is a 7°. These requirements are necessary or the backgrounds become 
extremely large. Both charged and neutral kaons are used. Of course, imposing these 
restrictions leads to a model dependence of the result that must be carefully evaluated. 
This is why having an independent technique is useful. That is provided by detecting only 
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Figure 33. Levels of inclusive photons from various background processes at the Y (AS) 
labeled largest to smallest at 2.5GeV/c. Also shown is the expected signal from b > sy. 


the high energy photon. The technique used is to form a neutral network to discriminate 
between continuum and Y(4S) data using shape variables. 


The momentum spectrum of the y peaks close to its maximum value at half the B 
mass. If we had data with only B mesons, it would be easy to pick out b > sy. We have, 
however, a large background from other processes. At the T(45), the y spectrum from 
the different background processes is shown. The largest is 1? production from continuum 
ete” collisions, but another large source is initial state radiation (ISR), where one of the 
beam electrons radiates a hard photon before annihilation. The backgrounds and the 
expected signal are illustrated in Figure 33. Similar backgrounds exist at LEP. 


signal 


Continvum Continuum + LSR 
A 


Figure 34. Examples of idealised event shapes. The straight lines indicate hadrons and 
the wavy lines photons. 


CLEO also made the first measurement of the inclusive rate for b — sy (Alam 1995). 
To remove background CLEO used two techniques, one based on "event shapes" and the 
other on summing exclusively reconstructed B samples. Examples of idealised events are 
shown in Figure 34. CLEO uses eight different shape variables and defines a variable r 
using a neural network to distinguish signal from background. The idea of the B re- 
construction analysis is to find the inclusive branching ratio by summing over exclusive 
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modes. The allowed hadronic system is comprised of either a K, — ntr” candidate or 
a K* combined with 1-4 pions, only one of which can be neutral. The restriction on 
the number and kind of pions maximises efficiency while minimising background. It does 
however lead to a model dependent error. For all combinations CLEO evaluates 


Mp — 5.279\? (Eg — Ewan V. 
x&-( B ) «( B bea E (48) 
OM OE 


where Mg is the measured B mass for that hypothesis and Eg is its energy. X2, is 
required to be « 20. If any particular event has more than one hypothesis, the solution 
which minimises X2, is chosen. For events with a reconstructed B candidate CLEO also 
considers the angle between the direction of the B and the thrust axis of event with the B 
candidate removed, cos(@,). This is highly effective in removing continuum background. 


A neural network is used to combine r, X2, cos(6,) into a new variable r, and events 
are then weighted according to their value of re. This method maximises the statistical 
potential of the data. Figure 35 shows the photon energy spectrum of the inclusive signal 
from CLEO combining both reconstruction techniques. The signal is compared to a 
theoretical prediction based on the model of Ali and Greub (Ali 1991). A fit to the model 
over the photon energy range from 2.0 to 2.7GeV /c gives the branching ratio result shown 
in Table 5, where the first error is statistical, the second systematic and the third shows 
the dependence on the theoretical model (Chen 2001). 
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Figure 35. The background subtracted photon energy spectrum from CLEO. The spectrum 
is not corrected for resolution or efficiency. The solid lines show the spectrum from a 
simulation of the Ali-Greub spectator model with the b quark mass set to 4.690GeV and 
the Fermi momentum set to 410MeV/c. 


ALEPH reduces the backgrounds by weighting candidate decay tracks in a b > sy 
event by a combination of their momentum, impact parameter with respect to the main 
vertex and rapidity with respect to the bhadron direction (Barate 1998). 


Current results are shown in Table 5. The data are in agreement with the Standard 
Model theoretical prediction to next to leading order, including quark mass effects of 
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Experiment B x 1074 
CLEO 3.21 + 0.43 + 0.27*018 
ALEPH 3.11 + 0.80 + 0.72 
BELLE 3.36 + 0.53 + 0.447030 
Average 3.23 + 0.42 


Table 5. B(b > s»). 


(3.73 + 0.30) x 107* (Hurth 2001). A deviation here would show physics beyond the 
Standard Model. More precise data and better theory are needed to further limit the 
parameter space of new physics models, or show an effect. 


5.3.1 |Væ| using moments of the photon energy spectrum 


In Section(2.2.4) we found a value of Va using the first and second hadronic mass moments. 
Here we use the first moment of the photon energy distribution in b — sy. The values 
found for the moments and A which is directly proportional to (E,) are (Chen 2001) 


(Ej) = 2.346 + 0.032 + 0.011 GeV (49) 
(E2) - (Ey)? = 0.0226 + 0.0066 + 0.0020 GeV? (50) 
A = 0.35+0.08 + 0.10 GeV. (51) 


In Figure 36 we show the combination of first moments from photon energy in b — sy 
and hadron moments in b — cév. This implies a value of Va around 0.0406. 


Figure 36. Correlation between ,, A and Va, derived from (E) and (M2. — Mh). The 
lighter bounds include both experimental and theoretical errors. 
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5.4 Rare hadronic decays 


The decays B? > ntr- and B? > K~x* do not contain any charm quarks in the final 
state, so they must proceed either via the tree level V, process shown in Figure 37(left), 
or via the penguin process shown in Figure 37(right). 
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Figure 37. Decay diagrams for B^ ntr. (left) Via tree level Vj process. (right) 
Via a penguin process. 


These diagrams can interfere with each other, and they can also interfere through B° 
mixing, thus complicating any weak phase extraction. The same diagrams are applicable 
for B > K-t by replacing W^ — wud in the tree level diagram by W- — us and 
replacing the £d coupling in the penguin by a ts coupling. 
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Figure 38. Diagrams for B^ — K-n’, (a) and (b) are tree level diagrams where (b) 
is color suppressed; (c) is a penguin diagram. (d) shows B^ — K°n~, which cannot be 
produced via a tree diagram. 


Other diagrams for producing Kr final states are shown in Figure 38. In Section 7.1 
it will be shown that CP violation can result from the interference between two distinct 
decay amplitudes leading to the same final state. Consider the possibility of observing 
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Figure 39. Signals in M and AE for two-body decay modes from BELLE. The data 
result from projections of a Likelihood fit that takes into account event shape and particle 
identification information. The dashed lines are the signal projections. The dotted lines 
in the AE distributions are projections of the background component from the x & K 
substitution. 


CP violation by measuring a rate difference between B7 — K^ 1? and Bt + K*«?. The 
K n? final state can be reached either by tree or penguin diagrams. The tree diagram has 
an imaginary part coming from the V,, coupling, while the penguin term does not, thus 
ensuring a weak phase difference. This type of CP violation is called “direct.” Note also 
that the process B +K°x~ can only be produced by the penguin diagram in Figure 38(d). 
Therefore, we do not expect a rate difference between B- —nK*x- and B* K?s*. 


Measurements of these rates have been made by several groups. Recent data from 
BELLE are shown in Figure 39 (Abe 2001a). Table 6 lists the currently measured branch- 
ing ratios. 


Mode CLEO BABAR BELLE Average 
nta  ATSX06 414100.7 56:22:04 — 4.5793 

ntn? <12 «9.6 «13.4 

K*z* 188'28 +41.3 16716 E13 193732719 17.751 
Ktn? 121733275 — 108723016 — 16373518 12.1417 


K?r- 1827160116 182733520 — 13.750713  17.3:21 


Kn? — 148192724 — 82731012 — 16.0722:25 10.4727 


Table 6. Branching Ratios for B + Kx and B — s in units of 1075. 
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6 Hadronic decays 


6.1 Introduction 


Mark Wise in his talk at the 2001 Lepton Photon conference (Wise 2001) gave some advice 
to theorists: “If you drink the nonleptonic your physics career will be ruined and you will 
end up face down and in the gutter.” Presumably Mark’s statement was inspired by the 
difficulty in predicting hadronic decays. Here we have lots of gluon exchange with low 
energy gluons, while perturbation theory works well when the energies are large compared 
with Agcp ~200MeV. Furthermore, multibody decays are currently impossible to predict, 
so we will consider only two-body decays. 


6.2 Two-body decays into a charm or charmonium 


We start by considering two-body decays into a charmed hadron (Neubert 1997). Figure 6 
shows the processes for both B- and B? decays into a D and a 1^. There is only one 
possible process for the B, the simple spectator process (left), while the color suppressed 
spectator (right) is also allowed for the B7. We call decays with only the simple spectator 
diagram allowed "class I" and decays where both the simple and color suppressed diagrams 
are allowed "class HII". Note, that because the colors of all the outgoing quarks must be 
the same in the color suppressed case, naively the amplitude is only ~1/3 that of the 
simple spectator case where the W- can transform into quarks of all three colors. “Class 
II" decays are processes than can only be reached by the color suppressed spectator 
diagram, for example the B? — J/VK, decay shown in Figure 40. 


a 
b z} A 
‘—<w À S 
af Ks 
Figure 40. Color suppressed spectator decay diagram for B® — J/WK,. 


The effective Hamiltonian consists of local 4-quark operators renormalised at the scale 
u and the Wilson coefficients from the Operator Product Expansion, c;(u). We have 


Ha = FE [Va [ewe + uQ] + Va} (52) 
? = [uva (8c)v—a] @)v—a 


Q? (@u)v_a(db)y_a + (cc)v -A(b)v A, 


H 


where the notation (Giq@)v-a = Qrv,(1 — ys)q2. Without QCD corrections cj(u) = 1 
and c;(u) = 0. From the non-leading order correction using the renormalisation group 
equations, we have ci (u) = 1.132 and cs(u) = —0.249. 
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We can factorise the amplitude by assuming that the current producing the 7^ is 
independent of the one producing the charmed hadron. Let us consider a class I case, 
B? + D*z-. The amplitude can be written as 


G 4 
Ano, = — Va Vm (77 |(du) 4] 0) (D* |(@b)v| B^) (53) 
The part of the amplitude dealing with the 7^ is known from pion decay. We have 
(x7 |dy,7su|0) = if,py, where the axial vector structure is made explicit, p, is the pion 
four-vector and f, is given by measuring the decay width for 1^ — p v. The term a, is 
defined as 

a, = e(ug) + &o(ur), (54) 


where £ is equal to the number of colors and the scale p; is on the order of the b quark 
mass. Then 


G 
Afact = -VeV fs (ri = mÈ )EP ^? (m). (55) 


The Fo form factor can either be calculated or measured in semileptonic decays. 


For a class II process B > J/wK: 


Ana = -BVesVesaa(I | |(@e)v| 0)(K [30v] B), (56) 


where as = co(uj) + &ci (ps). 

A class III decay has a term a, + za» in the amplitude, where x equals one from flavor 
symmetry. The actual values of a, x, and az are not well predicted from theory, but we 
can obtain them from the data. 

One method is to use the class I decays to obtain a; = 1.08 + 0.04. It is possible to 
calculate ag/a, as shown in Figure 41 (Neubert 1997). Using these values the measured 
branching ratios are compared with the predicted ones in Table 7. Here x = +1 is used, 
taken from the data. This is opposite to the interference in D decays but is expected from 
the calculation shown in Figure 41. 


0 0.2 0.4 0.6 0.8 1 
RA 


Figure 41. Calculated values of az/a; versus œs. The arrows indicate the ratios chosen 
for B and D decays. 
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Class I Class II Class III 
Mode Model Data Mode Model Data Mode Model Data 
Dtr- 30 3044  J/vK- 11 1041 D°x- 48 5345 
D**«- 30 2842  J[vK* 17 1542  D'?g- 49 4643-4 
D*p- 70 79+14 Dg? 0.7 2.8+0.4 Dp- 110 134418 
D*+ p 85 73+15 Dr? 1.0 2.140.9 D'?p- 119 15531 


Table 7. Predicted Branching Ratios (1074) (Neubert 1997) Compared to Measurement 
For Two-Body Hadronic Decays. 


The agreement is rather good except for the newly measured D°‘*)7° modes where it 
is rather miserable. CLEO and BELLE both have rates for D^z? of (2.7 + 0.3 + 0.5) x 
1074 and (2.9*53 + 0.6) x 1075, respectively, while CLEO alone has measured D*?z? as 
(2.1 30.5 + 0.8) x 1074. 


6.2.1 Isospin analysis of the B — D system 


All the decay rates for B — Dr have now been measured. The four-quark operator 
(du)(cb) has isospin I=1 and I;=+1. It transforms the B into final states D*z- and 
D°r° with I=1/2 or I-3/2. The B- decays into D?z- with I=3/2 only. It is thought that 
the isospin amplitudes cannot be modified by final state interactions, so we can look for 
evidence of final state phase shifts by doing an “isospin analysis.” The decay amplitudes 
form a triangle as shown in Figure 42. 


2 Ace AS 


+- 
A 
Figure 42. The isospin amplitude triangle for B — Dn- decays. 


The relationship among B decay amplitudes and isospin amplitudes is given by 


1 2 
AB? > Dir) = Jisan + Jian 
2 1 
ase — Jian (57) 


A(B- => D?sz ) = V3A3/2. 


A(B° — D^) 


These equations may be solved for the isospin amplitudes and the relative phase shift 


between the two amplitudes. The solution is 
2 ] 2 
lus = AG? > D+)? + |A’ > D°)? - 3B > Dal (58) 
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lupf = iau > Df 
3|A(B- > D°x-)|? - 2 |Aus| — [ayo] 
2/2 Aiyo| |Asy»| 


cosó = cos (63/2 — 52) = 


Solving these equations for the Dz final states gives cos ô = 0.88+0.05 which indicates 
a phase shift of about 28 + 8 degrees, which is not statistically significant enough to rule 
out zero. 


6.2.2 Factorisation tests using semileptonic decays 


The factorised amplitude for Dr~ decays in Equation 56 is the product of two hadronic 
currents, one for W- — «- and the other for B — D. In semileptonic decay (Figure 5) 
we have the product of the known lepton current and the pion current. At q? = m? the 
B — D should be the same in both decays, at least for class I. The comparison for the 
general case of any hadron A^ is 


T (B > DOR) = 6z?ai fz Vaal? " (B > Dc) (59) 


gam; 


Tests of this equation for D*+ and a 1^, p^ or aj are satisfied at about 15% accuracy 
(Bortoletto 1990) (Browder 1996). 


Another test compares the polarisation of the D* in both hadronic and semileptonic 


cases: LL (B EN D*h) = = (B > D*h) ; (60) 


52 
Pam, 


where Tz denotes the longitudinally polarised fraction of the decay width. Comparisons 
with data will be shown in the next section. 

There are more modern approaches to factorisation (Beneke 2001, and Bauer 2001). 
However these approaches predict 


[(B- > D°x-) 


PP aps |*00sm) (D 


which seems to contradict current observations. 


6.3 Observation of the p' in B decays 


CLEO made the first statistically significant observations of six hadronic B decays shown 
in Table 8 that result from studying the reactions B + D®)x+a-2~7°(Alexander 2001). 
The signal in one of these final states B^ — D(*?*z-52725, where D*+ > 1* D? and 
D? — K~7x* is shown in Figure 43. 


In examining the substructure of the four-pions, a clear w signal was observed in the 
ntra’ mass as can be seen in Figure 44, leading to a significant amount of D®wa-. 
Furthermore, there is a low-mass resonant substructure in the wz^ mass (Figure 45). 
The spin and parity of the wx” resonance (denoted by A temporarily) is determined by 
considering the decay sequence B A D; A-»uwmx and w m*w «?. The angular 
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Mode B (96) 

B —> D*+ntn on n? 1.7240.14+0.24 
B? > Dttwa- 0.29+0.03+0.04 
B? > Dtwr- 0.28+0.05+0.03 
B- > D'm'z-n n) 1.80+0.24+0.25 
B- > Dwr 0.45+0.10+0.07 
B- > D*ua- 0.41+0.07+0.04 


Table 8. Measured Branching Ratios 


Events / 2 MeV 


Figure 43. The B candidate mass spectra for the final state D** n*n-1- 19, with D? > 
Krt (a) for AE sidebands and (b) for AE consistent with zero. The curve in (a) is 
a fit to the background distribution described in the text, while in (b) the shape from (a) 


is used with the normalisation allowed to float and a signal Gaussian of width 2.7MeV is 
added. 


$ 50 € Signal 

o — Ma Sideband 
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t 
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Figure 44. The invariant mass spectra of nnn? for the final state D** nnan n?’ for 
all three D? decay modes K^n*, K^n*n^n- and K-m*m?. The solid histogram is the 
background estimate from the Mg lower sideband and the dashed histogram is from the 
AE sidebands; both are normalised to the fitted number of background events. There is 
an additional cut selecting the centre of the Dalitz plot. 
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Figure 45. (left) The background subtracted efficiency corrected wn mass spectrum from 
Bo (D*+ + D? + D*)wn- decays fit to a Breit- Wigner shape. 


distributions are shown in Figure 46. Here 04 is the angle between the w direction in the 
A rest frame and the A direction in the B rest frame; 0,, is the orientation of the w decay 
plane in the w rest frame; and X is the angle between the A and w decay planes. 


0 
cos 0. 


Figure 46. The angular distribution of 04 (top-left),9,, (top-right) and X (bottom). The 
curves show the best fits to the data for different J? assignments. The 07 and 1* are 
almost indistinguishable in cos04, while the 17 and 2* are indistinguishable in cosb, 
and X. The vertical axis gives efficiency corrected events (104 data events are used). 


The data are fit to the expectations for the various J^ assignments. The w polarisation 
is very clearly transverse (sin?6,) and that infers a 17 or 2+ assignment. The cos 64 
distribution prefers 17, as does the fit to all three projections. 


This structure is identified with the p’ because it has the correct J^ and is at approx- 
imately the right mass. To determine the mass and width parameters, which are not well 
known, we write the decay width as a function of wa7 mass as 


1 
dT(B- Dun”) = a; |A(B > Dp): BW mur) - A(p > wr”) (62) 
B 


x dP(B > Dg): dP(p ^ un) ee, 
7 
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where dP indicates phase space and the Breit-Wigner amplitude is given by 


1 
(m2, m m?) = Vmus tot (Mun) ' 


BW (Maor) = 


(63) 


The Breit-Wigner fit assuming a single resonance and no background gives a mass of 
1349:-25* MeV with an intrinsic width of 547+8674®MeV. The fit shows that the wz- 
mass spectrum is consistent with being entirely one resonance. This state is likely to be 
the elusive p' resonance (Clegg 1994). These are by far the most accurate and least model 
dependent measurements of the p' parameters. Since the p' dominates the final state, the 
branching ratios for D(?)un- apply also to D(?p'-. 


Heavy quark symmetry predicts equal partial widths for D*p' andDp'. CLEO mea- 
sures the relative rates to be (B > D*p'-)/T (B > Dp") = 1.06+0.17+0.04, consistent 
within the relatively large errors. 


Factorisation predicts that the fraction of longitudinal polarisation of the D** is the 
same as in the related semileptonic decay B — D*£^v at four-momentum transfer q? 
equal to the mass-squared of the p' 


T: (B > D'*g-) T,(B D't-v) 


T(B2D**g-)  T(Bo DED) | (64) 


CLEO's measurement of the D** polarisation is (6349)%. The model predictions in 
semileptonic decays for a q? of 2GeV?, are between 66.9 and 72.696 (Isgur 1990, Wirbel 
1985, Neubert 1991). Figure 47 shows the measured polarisations for the D**p'-, the 
D** p7, (Artuso 1999) and the D** D*~ final states (Stone 2000). The latter is based on 
a new measurement using partial reconstruction of the D** (Ahmed 2000). Thus this 
prediction of factorisation is satisfied. 


C] Factorization Prediction (10) Region 
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Figure 47. Measured D** polarisation versus semileptonic model predictions. 
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7 CP violation 


7.1 Introduction 


Recall that the operation of Charge Conjugation (C) takes particle to anti-particle and 
Parity (P) takes a vector r to —r. CP violation can occur because of the imaginary term 
in the CKM matrix, proportional to 7 in the Wolfenstein representation (Bigi 2000). 


Consider the case of a process B — f that goes via two amplitudes A and B each of 
which has a strong phase, s4, and a weak phase, wa. Then we have 


M(B f) = (lAleleatea) 4 |B] eonim? (65) 
M(B >F) = (Aeae 4 [B| eos? (66) 
T(B— f)-I(B> f) = 2|AB|sin(s4 — sg) sin(w4 — wg). (67) 


Any two amplitudes will do, though it is better that they be of approximately equal 
size. Thus charged B decays can exhibit CP violation as well as neutral B decays. In 
some cases, we will see that it is possible to guarantee that [sin(s4 — sg)| is unity, so we 
can get information on the weak phases. In the case of neutral B decays, mixing can be 
the second amplitude. 


7.2 Unitarity triangles 


The unitarity of the CKM matrix, namely 


Vaa Vas Vab Va Và Va 100 
Vea Ves Vo || Vs Vs Vie | =]0 1 Of, 
Via Vis Vo} VV Va Vg 001 


allows us to construct six relationships of which the most useful turns out to be 
VaaVo + Vas V + Vasa, = 0. (68) 
To a good approximation 
Vaa 82 Vg zl, Va = —Va, Vis =A, (69) 


so that Equation 68 becomes 


Vä 1 Vb 
—— — ——- =Q. 0 
XVa t XV (70) 
Now 
1 | Via 1 2 
= 1 2 72 
XV. 3Vo-0D +n (72) 
1 [Vus lr 
ee M 73 
NEA VP tm, (73) 
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Figure 48. The CKM unitarity triangle in the p- plane. The left side is determined by 
measurements of b > u/b > c and the right side can be determined using mixing mea- 
surements in the B, and Bg systems. The angles can be found by making measurements 
of CP violating asymmetries in hadronic B decays. 


so that Equation 70 defines a triangle in the p-7 plane. This is the CKM triangle depicted 
in Figure 48. We know the length of two sides already: the base is defined as unity and 
the left side is determined by the measurements of |Vus/Væl, but the error is still quite 
substantial. The right side can be determined using mixing measurements in the neutral 
B systems. Figure 48 also shows the angles labelled o, £, and y. These angles can be 
determined by measuring CP violation in the B system. 


Another constraint on p and 7 is given by the measurement of e in CP violation in K? 
decays (Buras 1995): 


B 
n (1 — p)A?(1.4 + 0.2) + 0.35] ATE = (0.30 + 0.06), (74) 


where By is parameter that cannot be measured and thus must be calculated. A reason- 
able range is 0.6 < By < 0.9, given by an assortment of theoretical calculations (Buras 
1995). This number is one of the largest sources of 


uncertainty on the e constraint. Other constraints come from current measurements 
on Vi»/Veo, Ba mixing and B, mixing. The current status of these constraints on p and 
n is shown in Figure 49 (Hocker 2001). The width of these bands are also dominated by 
theoretical errors. This shows that the data are consistent with the standard model. 


It is crucial to check if measurements of the sides and angles are consistent, i.e., whether 
or not they actually form a triangle. If they do not the standard model is incomplete. 
The standard model has many parameters including the four CKM numbers, six quark 
masses, gauge boson masses and coupling constants. Perhaps measurements of the angles 
and sides of the unitarity triangle will bring us beyond the standard model and show us 
how these parameters are related, or what is missing. 


New physics can also be observed by measuring branching ratios which violate standard 
model predictions. Especially important are rare decay processes such as B > Kutu 
or D > «vu* u^. These processes occur only through loops, and are an important class of 
penguin decays. 
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Figure 49. The regions in p—7j space (shaded), where p = p(1—2?/2) andy = n(1—A?/2), 
consistent with measurements of CP violation in K? decay (c), Va/Væ in semileptonic B 
decay, BY mizing, and the excluded region from limits on B? mizing. The allowed region 
is defined by a fit from (Hocker 2001). The large width of the By mixing band is dominated 
by the uncertainty in Bp fZ. The lines that are not specified are at 5% confidence level. 


7.3 Formalism of CP violation in neutral B decays 


Consider the operations of Charge Conjugation, C, and Parity, P: 


C|B(p) = [B(p)), C|B(p) = |B(p) 
P|B(p) = -|B(-p)), P|B(p) = -IB(-p) 
CP|B(p) = -IB(-p)), CP|B(p) = -|B(-p). 


For neutral mesons we can construct the CP eigenstates 


0 1 0 p 
IB) = z (2° - 1B )), 
0 1 0 p 
|B) = Z (1B°) + [B)), 


where 


CP|B)) = |B); 
CP|B)) = —|B3). 


Since B? and B can mix, the mass eigenstates are a superposition of a| B°) + b[B^) which 


obey the Schrodinger equation 


(i) -*()- G49 (7) 


(82) 
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If CP is not conserved then the eigenvectors, the mass eigenstates |B,) and | By), are not 
the CP eigenstates but are 


Lu =0 
|By) = p|B°) + a[B ), |Bu) = »|B^) — |B), (83) 
where 
p- 1 lte -1 ke (84) 
v2 A leal? v2 JA 4+ lea? 
CP is violated if eg # 0, which occurs if |g/p| Z 1. 
'The time dependence of the mass eigenstates is 
|B,(t) = eT Pene, (0) (85) 
|Bu(t)) = e "Permet? By (0)), (86) 
leading to the time evolution of the flavor eigenstates as 
; Amt Amt 
|B(t)) = e (met /2)t (s —-1B°(0)) + i sin > oy) (87) 
; Amt Amt 
[B'()) = emm (z sin 7 |B^(0)) + cos 77 io») (88) 


where m = (m; +my)/2, Am = my — m, T =T; Tu, and t is the decay time in 
the B? rest frame, the so called proper time. Note that the probability of a B^ decay as a 
function of t is given by (B®(t)|B°(t))*, and is a pure exponential, e '"?, in the absence 
of CP violation. 


7.3.1 CP violation for B via interference of mixing and decays 


Here we choose a final state f which is accessible to both B? and p decays (Bigi 2000). 
'The second amplitude necessary for interference is provided by mixing. Figure 50 shows 
the decay into f either directly or indirectly via mixing. It is necessary only that f be 


Lm N 
es B? reet 


Figure 50. Two interfering ways for a B" to decay into a final state f. 


accessible directly from either state; however if f is a CP eigenstate the situation is far 
simpler. For CP eigenstates 
CP|fcp) = | fcp). (89) 


It is useful to define the amplitudes 


A = (fee B^), A = (fce HB’). (90) 
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If [A/A| # 1, then we have "direct" CP violation in the decay amplitude, which we will 
discuss in detail later. In general CP can be violated by having 


*1 (91) 


which is also the case if À acquires a non-zero phase, i.e |A| could be unity and CP violation 
can occur. 


A comment on neutral B production at ete” colliders is in order. At the Y(4S) 
resonance there is coherent production of B°B® pairs. This puts the B's ina C = —1 
state. In hadron colliders, or at e* e^ machines operating at the Z^, the B's are produced 
incoherently. For the rest of this article I will assume incoherent production except where 
explicitly noted. 


The asymmetry, in this case, is defined as 


Afop = (92) 
T T(B(t) 2 fce) +T (B'(t) > fc») 
which for |g/p| = 1 gives 
_ (1 — JAP) cos (Amt) — 21mA sin(Amt) 
afcr — 1 n DE M (93) 
For the cases where there is only one decay amplitude A, |A| equals 1, and we have 
Gg, = —ImA sin(Amt). (94) 


Only the amplitude, —ImA, contains information about the level of CP violation, the sine 
term is determined only by B4 mixing. In fact, the time integrated asymmetry is given 
by 


HA 
atop = 7 ilm A = —0.481m A. (95) 


This is quite lucky as the maximum size of the coefficient for any x is —0.5. 


Let us now find out how Im A relates to the CKM parameters. Recall 4 = (q/p)(A/A). 
The first term is the part that comes from mixing: 


2 
* + 
n (V Via) _ (1— p - ig? LL QUAE 
Po |VaVeal? — (0-03) (1 — p — in) 


(96) 


and 


Im 2 = 2d pin = sin(28). (97) 
p è ü-prem 
To evaluate the decay part we need to consider specific final states. For example, 
consider f = «^«^. The simple spectator decay diagram is shown in Figure 37 (left). For 
the moment we will assume that this is the only diagram which contributes, although we 
know this is not true. For this b + uüd process we have 


(vv. ) _ in)? . 
ud ub E (p in) — e? (98) 
ẹVaa Væl — (o— imo im) ' 


A 
= 
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and 
Im (A) = Im (e ?2e-??) = Im (e?) = — sin(2a). (99) 


For our next example let us consider the final state J/1)K,. The decay diagram is 
shown in Figure 40. In this case we do not get a phase from the decay part because 


(vavz)' 


100 
[Va Val (100) 


LA 
= 


is real to order 1/A*. 


In this case the final state is a state of negative CP, i.e. CP|J/9 K,) = —|J/vK,). 
This introduces an additional minus sign in the result for Im A. Before finishing discussion 
of this final state we need to consider in more detail the presence of the A, in the final 
state. Since neutral kaons can mix, we pick up another mixing phase (similar diagrams 
as for B°, see Figure 26). This term creates a phase given by 


2 
Vai Ves 
() = ive) (101) 
p K [Vea Ves] 


which is real to order A*. It necessary to include this term, however, since there are other 
formulations of the CKM matrix than Wolfenstein, which have the phase in a different 
location. It is important that the physics predictions not depend on the CKM convention. 
(Here we don't include CP violation in the neutral kaon since it is much smaller than what 
is expected in the B decay. The term of order A* in V;, is necessary to explain K^ CP 
violation.) 


In summary, for the case of f = J/ K,, Im A = — sin(2f)). 


7.3.2 Comment on the penguin amplitude 


In principle all processes can have penguin components. One such diagram is shown in 
Figure 37(right). The 7*7 final state is expected to have a rather large penguin ampli- 
tude ~10% of the tree amplitude. Then |A| 4 1 and a44(t) develops a cos(Amt) term. It 
turns out that sin(2o) can be extracted using isospin considerations and measurements 
of the branching ratios for B+ > «*«? and B? — sa, or other methods, the easiest of 
which appears to be the study of B9 — pn. 


In the J/wK, case, the penguin amplitude is expected to be small since a cc pair must 
be *popped" from the vacuum. Even if the penguin decay amplitude were of significant 
size, the decay phase is the same as the tree level process, namely zero. 


7.4 Results on sin 28 


For years observation of large CP violation in the B system was considered to be one 
of the corner stone predictions of the Standard Model. Yet it took a very long time to 
come up with definitive evidence. The first statistically significant measurements of CP 
violation in the B system were made recently by BABAR and BELLE. This enormous 
achievement was accomplished using an asymmetric e*e^ collider on the T(4S) which 
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Experiment sin 28 
BABAR 0.59 + 0.14 + 0.05 
BELLE 0.99 + 0.14 + 0.06 


Average 0.79+0.11 
CDF 0.79*51, 
ALEPH 0.841052 + 0.16 
OPAL 33*55 + 0.5 


Table 9. Statistically significant measurements of sin 20 compared with previous results. 


was first suggested by Pierre Oddone. The measurements are listed in Table 9, along with 
other previous indications (Groom 2001). 


The average value of 0.79 + 0.11 is taken from BABAR and BELLE only. These two 
measurements do differ by a sizeable amount, but the confidence level that they correctly 
represent the same value is 696. The value of 0.79 is consistent with what is expected 
from the other known constraints on p and 7. We have 


jaf PL (12) 


" sin 28 


There is a four fold ambiguity in the translation between sin 28 and the linear constraints 
in the p~7 plane. These occur at 8, */2— 8, v -- B and 37/2 — B. Two of these constraints 
are shown in Figure 51. The other two can be viewed by extending these to negative 7. 
We think 7 > 0 based only on the measurement of e in the neutral kaon system. 


1 


sin(2ß) 


Figure 51. Constraints from sin 20 measurement overlaid with other constraints (Hocker 
2001). The inner band is at 1o while the outer band, shown on one band only, is at 2c. 
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7.5 Remarks on global fits to CKM parameters 


The fits shown in this paper (Hocker 2001) for p and 7 have been done by others with 
a somewhat different statistical framework (Ciuchini 2001, Mele 1999). The latter group 
uses “Bayesian” statistics which means that they use a priori knowledge of the probability 
distribution functions. The former are termed "frequentists" (Groom 2001), almost for 
the lack of a better term. The frequentists are more conservative than the Bayesians. 


The crux of the issue is how to treat theoretically predicted parameters that are used 
to translate measurements into quantities such as Va or ey that form constraints in the 
p — 7 plane. This is of course a problem because it is difficult to estimate the uncertain- 
ties in the theoretical predictions. Both groups treat the experimental measurements as 
Gaussian distributions with the 1o width derived from combined statistical and system- 
atic errors. Note, that experimental systematic errors are also difficult to quantify and 
are not necessarily Gaussian, but they are judged to be sufficiently well known to not 
cause a problem. 


Hocker et al. (2001) have decided to use a method which restricts the theoretical 
quantities to a 9596 confidence interval where the parameter in question is just as likely 
to lie anywhere in the interval. They call this the “Rfit” method. Of course assigning 
the 9596 confidence interval is a matter of judgment which they fully admit. Ciuchini 
et al. (2001) treat the theoretical errors in the same manner as the experimental errors. 
They call theirs “the standard method" with just a bit of hubris. They argue that QCD 
is mature enough to trust its predictions, that they know the sign and rough magnitude 
of corrections and they can assign reasonable errors, so it would be wrong to throw away 
information. 


Hocker et al. point out an interesting but generally unknown danger with the Bayesian 
approach, which is that in multi-dimension problems the Bayesian treatment unfairly 
predicts a narrowing of possible results. The following discussion will demonstrate this. 


Let z; denote N theoretical parameters over the identical ranges [-A. +â]; then the 
theoretical prediction is 


N 
T =J] x. (103) 
In the 95% scan scheme (TO) = [-AP, - AN], while in the Bayesian approach the con- 
voluted Probability Density Function (PDF) is 
+00 too N (N) 
p(T) = f » f T] 42:G(a,)6(T - T£"), (104) 


where the G(z;) are the PDF's for the individual variables which taken to be equal here. 
This function has a singularity in p(T) that goes as (— In T)"^!, so it increases as N 
increases. 


Now suppose G(z;) is flat, then for N = 1 both approaches are the same, but for 
N > 2, the Bayesian approach gets a p(T) that peaks at zero. In effect, when the number 
of theoretical predictions entering the computation increases, and hence our knowledge of 
the corresponding observable decrease the Bayesian approach claims the converse. 


Lets look at a specific example where N — 3 and A — V3. Consider both the sum 
Ts = xı + 35 + z34 and product Tp = 112223 distributions. For Rfit the allowed ranges are 
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Figure 52. Convolution of the sum Ts = zı + £2 + z3 for both Rfit and Bayesian methods 
(left) and the product Tp = 212223 (right) of 3 parameters for the Bayesian method only. 
Plotted is the PDF p(T) obtained using for G(x) a uniform (solid lines, A = V3) or a 
Gaussian (dashed lines, a = 1) distribution. Both PDF’s of Tp have a singularity at the 
origin which is not shown. The Rfit ranges of Ts and Tp are indicated by the arrows 
located in both instances at +3\/3. From (Hocker 2001). 


identical being [—3A, 3A]. The left side of Figure 52 shows the probability density p(T) 
for Ts, while the right side shows p(T) for Tp with G(x) in the Bayesian case being either 
a Gaussian with c = 1 (solid lines) or a uniform distribution over the range [— V3, + V3] 
(dashed lines). The later distribution is closest to the Rfit method where the allowed 
range for either T is [-3v3, +33] indicated by the arrows. 


In the Rfit scheme the two predictions for Tp and Ts are identical, while in the Bayesian 
scheme there is a large difference in the PDF's and it really doesn't matter if a Gaussian or 
uniform distribution is chosen. There is a clear distinction between the Rfit and Bayesian 
predictions for Tp in this case, and the Bayesian one is unreasonable because it produces 
a very narrow PDF peaked very close to zero. 


An example of how this can effect the results is shown on Figure 53 where predictions 
of sin 28 from the indirect measurements are shown for the Rfit technique and either 
uniform or Gaussian Bayesian PDF's. The predictions are quite different. 


1.6 ~~ T T T T T To 700—997 T 
3 146 — Rfit 4242 sees Bayesian (Gauss) 
E, | 0 0 0 0 0 0 0 00 0 o mme Bayesian (uniform) 
m l 
£ 08 
E 0.6 
g 04 
IS 


0.2 


Figure 53. Comparison between Rfit (broad solid curve) and Bayesian fits for the indirect 
CKM constraints on sin 28. For the Bayesian fits: Gaussian (uniform) theoretical PDF's 
are depicted as dashed-dotted (dotted) curves. (This example does not include the newer 
data.) 
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8 New Physics studies 


8.1 Introduction 


There are many reasons why we believe that the Standard Model is incomplete and there 
must be physics beyond. One is the plethora of "fundamental parameters," for example 
quark masses and mixing angles. The Standard Model cannot explain the smallness of 
the weak scale compared to the GUT or Planck scales; this is often called "the hierarchy 
problem." In the Standard Model it is believed that the CKM source of CP violation 
extensively discussed here is not large enough to explain the baryon asymmetry of the 
Universe (Gavela 1993). Finally, gravity is not incorporated. John Ellis said “My personal 
interest in CP violation is driven by the search for physics beyond the Standard Model" 
(Ellis 2000). We can also take the view that we will discover additional large unexpected 
effects in b and/or c decays. 


We must realise that all our current measurements are a combination of Standard 
Model and New Physics; any proposed models must satisfy current constraints. Since 
the Standard Model tree level diagrams are probably large, let us consider them as a 
background to New Physics. Therefore loop diagrams and CP violation are the best 
places to see New Physics. The most important current constraints on New Physics 
models are 


» The neutron electric dipole moment, dy < 6.3 x 10-?79e-cm. 

e B(b > sy) = (3.23 + 0.42) x 107* and B(b > s£*£7) < 4.2 x 10 5. 
e CP violation in K; decay, ex = (2.271 + 0.017) x 107°. 

» B? mixing parameter Ama = (0.487 + 0.014) ps! 


8.2 Generic tests for new physics 


We can look for New Physics either in the context of specific models or more generically, 
looking for any deviations from the Standard Model expectation. One example is to 
examine the rare decays B > K£*£- and B — K*é*€ for branching ratios and polari- 
sations. According to Greub et al., “Especially the decay into K* yields a wealth of new 
information on the form of the new interactions since the Dalitz plot is sensitive to subtle 
interference effects" (Greub 1995). Another important tactic is to test for inconsistencies 
in Standard Model predictions independent of specific non-standard models. 


The unitarity of the CKM matrix allows us to construct six relationships. These may 
be thought of as triangles in the complex plane shown in Figure 54. (The bd triangle is 
the one depicted in Figure 48.) 


All six of these triangles can be constructed knowing four and only four independent 
angles (Silva 1997) (Aleksan 1994). These are defined as: 


- VV _ VibVua 
B — arg ( VaV* , y= arg VŽ Vad , (105) 


V* Va VV, 
x= Es 1 _ "ud "us . 
arg ( VV. ; X! = arg VEV., (106) 
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Figure 54. The six CKM triangles. The bold labels, e.g. ds refer to the rows or columns 
used in the unitarity relationship. The angles defined in equation (105) are also shown. 


(a can be used instead of y or 8.) Two of the phases 3 and y are probably large while X 
is estimated to be small 220.02, but measurable, while X' is likely to be much smaller. 


It has been pointed out by Silva and Wolfenstein (Silva 1997) that measuring only 
angles may not be sufficient to detect new physics. For example, suppose there is new 
physics that arises in B? — p mixing. Let us assign a phase to this new physics. If we 
then measure CP violation in B? > J/1K, and eliminate any penguin pollution problems 
in using B^ — ntr”, then we actually measure 26' = 28+ 0 and 20! = 2a — 0. So while 
there is new physics, we miss it, because 20' + 20" = 2a + 28 and a’ + 6’ + y = 180°. 


8.2.1 A critical check using X 


The angle X, defined in Equation 105, can be extracted by measuring the time dependent 
CP violating asymmetry in the reaction B, — J/wn"), or if one’s detector is incapable 
of high quality photon detection, the J/1/9 final state can be used. However, in this case 
there are two vector particles in the final state, making this a state of mixed CP, which 
requires a large statistics sample to do a time-dependent angular analysis to extract X. 


Measurements of the magnitudes of CKM matrix elements all come with theoretical 
errors. Some of these are hard to estimate. The best measured magnitude is that of 
A = |Vas/ Vaa| = 0.2205 + 0.0018. 

Silva and Wolfenstein (Silva 1997) (Aleksan 1994) show that the Standard Model can 
be checked in a profound manner by seeing if: 


Vis |? sin 8 siny 


Vaal sin(8 +7) 
Here the precision of the check will be limited initially by the measurement of sin X, not 
of A. This check can reveal new physics, even if other measurements have not shown any 
anomalies. 


sin X — 


(107) 


Other relationships to check include: 


. Vis |? sin sin(B + y) 
X — 
sin Va sin 8 , (108) 
2: . 
sinx = a| S02 sin(8 + 7) (109) 
ts Sin y 
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The astute reader will have noticed that these two equations lead to the non-trivial 
relationship: 
2 


sin? 8 = sin? y | 
cb 

This constrains these two magnitudes in terms of two of the angles. Note, that it is in 

principle possible to determine the magnitudes of |V,,/V4| and |Via/Vis| without model 

dependent errors by measuring D, y and X accurately. Alternatively, 8, y and A can be 

used to give a much more precise value than is possible at present with direct methods. 

For example, once @ and y are known 

Var? 2 sin? 8 

Val ^ sin'(8 y) 


Table 10 lists the most important physics quantities and the decay modes that can be 
used to measure them. The necessary detector capabilities include the ability to collect 
purely hadronic final states labeled here as “Hadron Trigger,” the ability to identify 
charged hadrons labeled as “Kx sep,” the ability to detect photons with good efficiency 
and resolution and excellent time resolution required to analyze rapid B, oscillations. 
Measurements of cos(26) can eliminate 2 of the 4 ambiguities in ¢ that are present when 
sin(2ó) is measured. 


2 
V, 


(110) 


(111) 


Physics | Decay Mode Hadron | Kr | y | Decay 
Quantity Trigger | sep det | time c 
sin(2a) B? 5 pr — ntron’ J Viv 

cos(2a) B? 5 pr — ntron’ vi Viv 
sign(sin(2a)) | B? > pr & B^ > ntr” i VE 

sin(9) B, > DEK* v |v V 
sin(5) B- => D'K- V Iv 

sin(y) B? > arta & B,  K*K- V V V 
sin(2X) B, > Jin, J/Yn Vi v 
sin(28) B? 5 JYK, 

cos(28) B? + Jh K9, K? > rly V 

cos(28) B? 5 JhyK*? & B, I/we 

Ls B, > Din J J 
AT for B, | B; > J/wn!, Diz-, K+K- V [VV] yv 


Table 10. Required CKM Measurements for b's 


8.2.2 Finding inconsistencies 


Another interesting way of viewing the physics was given by Peskin (2000). Non-Standard 
Model physics would show up as discrepancies among the values of (p, 7) derived from in- 
dependent determinations using CKM magnitudes (|Vuo/Veol and |Via/Vis|), or B9 mixing 
(8 and o), or B, mixing (X and ~y). 
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8.2.3 Required measurements involving 8 


Besides a more precise measurement of sin 20 we need to resolve the ambiguities. T'here 
are two suggestions on how this may be accomplished. Kayser (1997) showed that time 
dependent measurements of the final state J/YK?, where K? > rév, give a direct mea- 
surement of cos(20) and can also be used for CPT tests. Another suggestion is to use the 
final state J/V K*^, K*®? — K;1?, and to compare with B, > J/y¢ to extract the sign of 
the strong interaction phase shift assuming SU(3) symmetry, and thus determine cos(2{) 
(Dighe 1998). 


8.2.4 Required measurements involving a and + 


It is well known that sin(20) can be measured without problems caused by penguin 
processes using the reaction B? — J/wK;. The simplest reaction that can be used to 
measure sin(2a) is B? + m+”. This reaction can proceed via both the tree and penguin 
diagrams shown in Figure 37. 


Current measurements (Table 6) show a large penguin component. The ratio of the 
penguin amplitude to the tree amplitude in the *^7^ channel is about 1596 in magnitude. 
Thus the effect of the penguin must be determined in order to extract o. The only model 
independent way of doing this was suggested by Gronau and London, but requires the 
measurement of Bt > n*z? and B? — 7°n°, the latter being rather daunting. 


There is however, a theoretically clean method to determine a. The interference 
between tree and penguin diagrams is exploited by measuring the time dependent CP 
violating effects in the decays B? — px — 1+7~7° as shown by Snyder and Quinn (1993). 


The pr final state has many advantages. First of all, it has been seen with a relatively 
large rate. The branching ratio for the p"z* final state as measured by CLEO is (1.5+0.5+ 
0.4) x 1075, and the rate for the neutral B final state ptr? is (3.5*14 £0.5) x 1075, while 
the p? final state is limited at 90% confidence level to < 5.1 x 107$ (Gao 1999). BABAR 
(Bona 2001) measures B (B° — ptr?) as (28.9 + 5.4 + 4.3) x 1079. These measurements 
are consistent with some theoretical expectations (Ali 1999). The associated vector- 
pseudoscalar K*7 decay modes have conquerable or smaller branching ratios. Since the 
p is spin-1, the 7 spin-0 and the initial B is spinless, the p is fully polarised in the (1,0) 
configuration, so it decays as cos? 0, where @ is the angle of one of the p decay products 
with respect to the other 7 from the B, determined in the p rest frame. This causes the 
periphery of the Dalitz plot to be heavily populated, especially the corners. À sample 
Dalitz plot is shown in Figure 55. This kind of distribution is good for maximising the 
interferences, which helps minimise the error. Furthermore, little information is lost by 
excluding the Dalitz plot interior, a good way to reduce backgrounds. 


To estimate the required number of events Snyder and Quinn preformed an idealised 
analysis that showed that a background-free, flavor-tagged sample of 1000 to 2000 events 
was sufficient. The 1000 event sample usually yields good results for a, but sometimes does 
not resolve the ambiguity. With the 2000 event sample, however, they always succeeded. 


This technique not only finds sin(2o), it also determines cos(2a), thereby removing 
two of the remaining ambiguities. The final ambiguity can be removed using the CP 
asymmetry in B? — «^-^ and a theoretical assumption (Grossman 1997). 
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Figure 55. The Dalitz plot for B? + px — n*n- m? from Snyder and Quinn (1993). 


Several model dependent methods using the light two-body pseudoscalar decay rates 
have been suggested for measuring y. The basic idea in all these methods can be sum- 
marised as follows: the tree diagram for B? — mtr” has the weak decay phase y. In 
order to reproduce the observed suppression of the decay rate for ntn” relative to KtrF 
we require a large negative interference between the tree and penguin amplitudes. This 
puts y in the range of 90°. A great deal of theoretical work is required to understand the 
effects of re-scattering, form-factors etc. We are left with several ways of obtaining model 
dependent limits, due to Fleischer and Mannel (1998), Neubert and Rosner (1998), Fleis- 
cher and Buras (2000), and Beneke et al. (2001). The latter use a sophisticated model of 
QCD factorisation and apply corrections. Figure 56 shows values of y that can be found 
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Figure 56. Model predictions from (Beneke 2001) as a function of the indicated rate 
ratios. The dotted curve shows the predictions from naive factorisation. The curved 
bands show the total model uncertainties where the inner band comes from theoretical 
input uncertainties, while the outer band allows for errors to corrections on the theory. 
The specific sensitivity to |V| is showed as the dashed curves. The gray bands show the 
current data with a lo error while the lighter bands are at 2c. 
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Figure 57. (left) The two diagram diagrams for B, > D£K* that interfere via B, 
mizing. (right) The two interfering decay diagrams for B^ > D'K- where one is a 
b — u transition and the other a doubly- Cabibbo suppressed decay. 


in their framework, once better data are obtainable. 


In fact, it may be easier to measure y than o in a model independent manner. There 
have been two methods suggested. 


(1) Time dependent flavor tagged analysis of B, > DKF. This is a direct model 
independent measurement (Du 1986) (Aleksan 1992) (Aleksan 1995). Here the Cabibbo 
suppressed V,, decay interferes with a somewhat less suppressed Va decay via B, mixing 
as illustrated in Figure 57 (left). Even though we are not dealing with CP eigenstates 
here there are no hadronic uncertainties, though there are ambiguities. 


(2) Measure the rate differences between B^ > D'K- and Bt > D°K* in two 
different D? decay modes such as Krt and K+K. This method makes use of the 
interference between the tree and doubly-Cabibbo suppressed decays of the D?, and does 
not depend on any theoretical modeling (Atwood 1997, Gronau 1991). See Figure 57 
(right). 


8.3 New Physics tests in specific models 
8.3.1 Supersymmetry 


Supersymmetry is a kind of super-model. The basic idea is that for every fundamental 
fermion there is a companion boson and for every boson there is a companion fermion. 
There are many different implementations of couplings in this framework (Masiero 2000). 
In the most general case we pick up 80 new constants and 43 new phases. This is clearly 
too many to handle so we can try to see things in terms of simpler implementations. In 
the minimum model (MSSM) we have only two new fundamental phases. One 8p would 
arise in B? mixing and the other, 04, would appear in B° decay. A combination would 
generate CP violation in D? mixing, call it 6%, when the decay is D? — K-x+ (Nir 
1999). Table 11 shows the CP asymmetry in three different processes in the Standard 
Model and the MSSM. 


Two direct effects of New Physics are clear here. First of all, the difference in CP 
asymmetries between B° > J/YK, and B? — K, would show the phase $4. Secondly, 
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Process Standard Model New Physics 

B? > JhikK, sin 28 sin2(8 + 0p) 

B? > 6K, sin 28 sin2(8 -- 05 + 04) 
D? > Kort 0 ~ Sin $x« 


Table 11. CP Violating Asymmetries in the Standard Model and the MSSM. 


there would be finite CP violation in D — K~-a+ where none is expected in the Standard 
Model. 

Manifestations of specific SUSY models lead to different patterns. Table 12 shows the 
expectations for some of these models in terms of these variables and the neutron electric 
dipole moment dy; see (Nir 1999) for details. Note, that “Approximate CP" has already 


Model dy x 107% 0p 04 Sin kr 
Standard Model < 1075 0 0 0 
Approx. Universality — 7» 107? ©(0.2) O(1) 0 
Alignment 210? = O(0.2) O(1 O(1) 
Heavy squarks ~ 107! O(1) O(1 OoO(10?) 
Approx. CP ~ 107} -B 0  O(1073) 


Table 12. Some SUSY Predictions. 


been ruled out by the measurements of sin 28. 


In the context of the MSSM there will be significant contributions to B, mixing, 
and the CP asymmetry in the charged decay BF — óK*. The contribution to B, 
mixing significantly enhances the CP violating asymmetry in modes such as B, > J/wn. 
(Recall the CP asymmetry in this mode is proportional to sin 2X in the Standard Model.) 
The Standard Model and MSSM diagrams are shown in Figure 58. The expected CP 
asymmetry in the MSSM is ~ sind, cos $4 sin(Am;t), which is approximately 10 times 
the expected value in the Standard Model (Hinchliff 20012). 
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Figure 58. The Standard Model (left) and MSSM (right) contributions to B9 mizing. 


We observed that a difference between CP asymmetries in B? > J/1 K, and ¢K, arises 
in the MSSM due to a CP asymmetry in the decay phase. It is possible to observe this 
directly by looking for a CP asymmetry in BF > ¢ KF. The Standard Model and MSSM 
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Figure 59. The Standard Model (left) and MSSM (right) contributions to BX + óK-. 


diagrams are shown in Figure 59. Here the interference of the two diagrams provides the 
CP asymmetry. The predicted asymmetry is equal to (Mw/ Msquark) sin $,, in the MSSM, 
where Mesquark is the relevant squark mass (Hinchliff 20012). 


The óK and $K™ final states have been observed, first by CLEO (Briere 2001) and 
subsequently by BABAR (Aubert 2001). The BABAR data is shown in Figure 60. The 
average branching ratio is B(B^ — ¢K~-) = (6.8 + 1.3) x 1075 showing that in principle 
large samples can be acquired especially at hadronic machines. 


A 


0 


events / (2 MeV/c’) 


0 
5.20 5.25 5.30 5.20 5.25 5.30 
2 
Mgs (GeV/c) 


Figure 60. Projections of the maximum likelihood fit onto the B candidate mass mes 
from BABAR. (a) Bt — óK*; (b) B? 2 @K°®; (c) Bt —^ óK**; (d) Bo > óK**. In 
(c) the histogram is the sum of the two contributing K* channels while the shaded area. is 
K?n* alone (the other channel is Kt+n®). The solid line shows the projection of the signal 
plus background fit while the dashed line shows the projection of the background only. 
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8.3.2 Other new physics models 


There are many other specific models that predict New Physics in b decays. I list here a 
few of these with a woefully incomplete list of references, to give a flavor of what these 
models predict. 


Two Higgs and Multi-Higgs Doublet Models predict large effects in ey and CP vio- 
lation in D° + K-^7* with only a few percent effect in B? (Nir 1999). Expect to 
see 1-10% CP violating effects in b — sy (Wolfenstein 1994). 


Left- Right Symmetric Model have contributions that compete with or even dominate 
over Standard Model contributions to B4 and B, mixing. This means that CP 
asymmetries into CP eigenstates could be substantially different from the Standard 
Model prediction (Nir 1999). 


Extra Down Singlet Quarks are likely to give dramatic deviations from Standard 
Model predictions for CP asymmetries in b decays (Nir 1999). 


FCNC Couplings of the Z boson in the decay B — K*£*£^ mean that both the 
sign and magnitude of the decay leptons carry sensitive information on new physics. 
Potential effects are on the of 1096 compared to an entirely negligible Standard 
Model asymmetry of ~ 107? (Buchalla 2000). These models also predict a factor of 
20 enhancement of b > d£*£^ and could explain a low value of sin 28 (Barenboim 
20012). 


Non-commutative Geometry. If the geometry of space time is non-commutative, i.e. 
[£u £v] = 10,,, then CP violating effects may be manifest a low energy. For a scale 
«2TeV there are comparable effects to the Standard Model (Hinchliffe 2001b). 


MSSM without new flavor structure can lead to CP violation in b > sy of up to 5% 
(Bartl 2001). Ali and London propose (Ali 1999) that the Standard Model formulas 
are modified by Supersymmetry as 


Ama, = Ama(SM) [1 + f (mys, mi, mys, tan A)| (112) 
Am, = Am,(SM) [1 + f (mys, mi, mus, tan 6) | (113) 
Gr fi; Mx My 


lex! By (APAT) Cu (n fs $$) — Me) 


6/21? ^My 
+ meyefs(Ye) [1 +f (my, mi, mas, tan )| A (1 — pt, (114) 


where Am(S M) refers to the Standard Model formula and the expression for |ex | 
would be the Standard Model expression if f were set equal to zero. Ali and London 
show that it is reasonable to expect that 0.8 > f > 0.2. Since the CP violating 
angles will not change from the Standard Model, determining the value of (p, 7) 
using the magnitudes Am,/Amg and |ex| will show an inconsistency with values 
obtained using other magnitudes and angles. 


Extra Dimensions are beginning appear in papers predicting 6 decay phenomena. 
See Agashe (2001), Barenboim (2001b), Branco (2001), Chang (2001) and Papavas- 
siliou (2000). 
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I close this section with a quote from Masiero and Vives (2001): “The relevance of 
SUSY searches in rare processes is not confined to the usually quoted possibility that 
indirect searches can arrive ‘first’ in signaling the presence of SUSY. Even after the pos- 
sible direct observation of SUSY particles, the importance of FCNC and CP violation 
in testing SUSY remains of utmost relevance. They are and will be complementary to 
the Tevatron and LHC establishing low energy supersymmetry as the response to the 
electroweak breaking puzzle." 


I agree, except that I would replace "SUSY" with *New Physics." 


8.4 Expected data samples 


It is clear that precision studies of b decays can bring a wealth of information to bear on 
new physics, that probably will be crucial in sorting out anything seen at the LHC. This 
is possible because we do expect to have data samples large enough to test these ideas 
from existing and approved experiments. In Table 13 I show the expected rates in BTeV 
for one year of running (107 s) and an ete~ B-factory operating at the Y (4S) with a total 
accumulated sample of 500 fb—1, about what is expected around 2006. (LHCb numbers 
are the same order of magnitude as the BTeV numbers for many of the modes.) 


BTeV (107)s B-factory (500 fb!) 
Yield Tagged! S/B Yield Tagged! S/B 


Mode 


B, > J/ yn 22000 2200 >15 

B- > 9K- 11000 — 11000 >10 700 700 4 
B? > $K, 2000 200 52 250 75 4 
B? > K*?u*u- 4400 4400 11 ~50 ~50 ? 


D*+ z*D* D? > Kort | ~108 —10* large 8x10° 8x 10° large 


1 Tagged here means that the initial flavor of the B is determined. 


Table 13. Comparisons of BTeV and B-factory Yields on Different Time Scales. 


9 Conclusions 


I have attempted to cover the length and breadth of b physics, and have only scratched 
the surface. There is much more to be said and much more to learn. Why are there three 
families? What is the connection with neutrinos and that mixing matrix? How do we 
explain the mystery of flavor? These and many more unanswered questions I leave to the 
students. 
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The two lectures which are written up in this paper, try to answer the questions why 
hadron machines look very promising for B physics studies, what are the problems in b 
hadron detection, and how these are solved in three experiments, CDF, an example of a 
general purpose collider experiment, HERA-B, a fixed target experiment, and BTeV, a 
specially designed collider experiment in the forward direction. Some of the advantages 
of hadron machines are evident from Table 1. The bb production cross section is typi- 
cally much higher in a hadron machine, as compared to an e*e^ machine. At HERA-B, 
where the cross-section is comparable to e*e^ production, the bb production rate can 


Production ete” + T(4s) | ete” > Z° pA —bbX | pp > bbX | pp(p) > bbX 
Mechanism > BB — bb forward 
Accelerators CESR, DORIS LEP, SLD HERA (P) Tevatron Tevatron 
PEPII, KEKB 
Detectors CLEO, ARGUS ALEPH, DELPHI HERA-B CDF, D0 BTeV 
BABAR, BELLE | L3, OPAL, SLD 
o(bb) = inb = 6nb = 12nb = 50ub = 100ub 
o (bb): (had) 0.26 0.22 1075 1073 2.10? 
B°, Bt yes yes yes yes yes 
Bo, Bs, AQ no yes yes yes yes 
boost (37) 0.06 (0.5) 6 = 20 z 2-4 zs 4—20 
multiple events | no no yes, 4 yes yes, 2 
main trigger inclusive inclusive lepton pairs | leptons displaced 
vertex 


Table 1. Comparison of bb production at ete” and hadron machines 
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be improved by increasing the luminosity. Another advantage of b hadrons produced in 
hadronic collisions, compared to the B-factories at the Y (4S), is the large boost which 
results in an average decay length, (L) = 480 um. Finally, we note that the T (4S) ma- 
chines only produce B? and B+ mesons, whereas the hadron machines also produce other 
b hadrons, including Bs, B, and A,. 


When designing a detector for b hadrons at a hadron machine, we have to consider 
several issues. High interaction rates are needed because of the rather low ratio of the 
signal cross section to the total cross section. In most cases this means we are dealing 
with multiple interactions per event, so the detector has to be able to tolerate high track 
densities (see Figure 1). The detector components that are used have to be radiation 
hard, and have a sufficiently fine granularity to separate out the track hits. 


Figure 1. Comparison of bb events: CDF(top), HERA-B(bottom) 
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The low ratio of signal to total cross section also requires an early selection of in- 
teresting events. Two approaches are used in developing selective triggers with good 
background suppression. The first approach relies on the fact that the decay products 
of a B meson have rather high transverse momenta in few-body final states, such as 
B > wX, B > J/K, > punt, B — n*n-. The final states with a single lepton or 
lepton pairs are particularly attractive because lepton identification can be done rather 
easily. À second approach takes advantage of the displacement of the decay vertices of 
b hadrons from the interaction point. In the case of HERA-B this distance is typically 
lcm. Both approaches are used in the experiments presented in this paper. The detection 
of high pr tracks only takes a few hundred nanoseconds, while the reconstruction of the 
vertex takes more time, and is à part of the higher trigger levels, although the BTeV 
experiment plans to reverse this order. 


The next section reviews the excellent results from the CDF detector. These serve 
as a proof of principle for doing B physics at a hadron machine. Then follows a brief 
discussion of how to measure B, mixing. The remaining sections describe the HERA-B 
and BTeV experiments. 


2 B physics with CDF 


2.1 CDF spectrometer 


The Tevatron collider provides pp collisions at a center-of-mass energy of 2TeV in two 
interaction regions, for the two experiments, CDF and D0. The CDF detector (Abe 1988) 
is a general purpose magnetic spectrometer with a silicon microstrip vertex detector, a 
vertex TPC, a main tracking chamber, an electromagnetic and a hadronic calorimeter, 
and a system of muon chambers (Figure 2). 


1.0 rd 1]- 2.0 


IR ALARAS EA R a 
2.0 3.0 m 


LAYER 00 SVX I INTERMEDIATE SILICON LAYERS 


Figure 2. CDF spectrometer: isometric view (left) with a verter detector system, a 
central tracking chamber, electromagnetic and hadronic calorimeters, a superconducting 
solenoid, and muon chambers; side view after the recent upgrade (right) with a new sili- 
con verter detector including the additional layer (L00), silicon tracker, central tracking 
chamber (COT), and additional calorimeters. 
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Figure 3. Examples of CDF performance: distribution of di-muon invariant mass (left), 
decay length distribution for di-muon events from the J/w mass band (right). 


For b physics studies there are two interesting figures of merit illustrated in Figure 3. 
The di-muon invariant mass resolution in the J/1/ region is 16MeV /c?, and the resolution 
on the position of the Jọ vertex is 40-50 um. The performance of the vertex detector is 
essential for improving the signal to background ratio, as can be seen for the By > J/wkt 
decay channel in Figure 4. 
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Figure 4. Cleaning up the Jy K* signal: a cut on decay length (>100um) significantly 
improves the signal to background ratio (bottom). 
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2.2 CDF results 


This is a brief summary of the results from CDF. The interested reader should consult 
overview papers (Paulini 1999, Pitts 2000) for more details. 


The excellent performance of the vertex detector allowed measurements of the time 
evolution, and hence of b lifetimes and BB mixing. The results of the lifetime measure- 
ments of b hadrons are summarized in Figure 5. The precision of these results, and of 
the By mixing measurements (Gay 2000), match the precision of the LEP experiments 
(Figure 5). A search for a B, mixing signal resulted in a lower limit Am, > 5.8ps~’ at 
9596 confidence level. 


Among the contributions to the spectroscopy of b hadrons the most important were 
the first observation of the B, meson and the determination of the masses of the Bs, A; 
and B, hadrons. CDF also measured the production cross section o(pp — bX) which 
turns out to be larger than theoretical predictions by about a factor of 2 (Bedeschi 2001). 


CDF studied time-dependent CP violation in By > J/K,, obtaining the result 
sin28 = 0.79*021 (Affolder 2000). For the next generation of precision b physics ex- 
periments a measurement of the polarization in the B, + J/w@ decay is important as 
an input for the measurement of AT'/T (Affolder 2000a). CDF has also shown that rare 
decays, such as By > K* up, can be studied at hadron machines with sensitivities com- 
parable to e*e^ machines (Affolder 1999). 


CDF B Lifetimes Am, Results 
7(B°) | |1.51+0.05ps ALEPH 0.446 + 0.020 + 0.018 ps” 
(B*) le [1.66+0.05 ps DELPHI 0.497 + 0.027 + 0.023 ps” 
0: 
1(B,) 1.36 + 0.10 ps i3 6.446 s DOTÉ 
0 : 
lA) 1.32 £0.17 ps OPAL 0.466 + 0.022 + 0.016 ps" 
(B + : 
(8) =- 046 + 0.17 ps SLD 0.526 + 0.043 + 0.031 ps” 
inc. xb) 32:004 ps 
; CDF 0.495 + 0.026 + 0.025 ps” 
1(B*)1(B’) 1.09 + 0.05 


0.480 + 0.010 + 0.013 ps” 


04 0.6 
Am, [ps] 


Figure 5. CDF results: lifetimes of b hadrons (left), By mizing parameter Ama compared 
to LEP and SLC results(right). 
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2.3 CDF plans for future 


For Run II the Tevatron collider has been upgraded with a new injector and antiproton 
source. The bunch spacing has been changed from 3.5us to 396ns, and will be further 
reduced to 132ns in the second stage. It is expected that 2fb^! of integrated luminosity 
will be accumulated in the years 2001-2004, and as much as 15fb^! by 2007. Both numbers 
should be compared to the 0.1fb^! of data collected in Run I. 


Major upgrades of the CDF and DO detectors have been carried out to take advantage 
of the machine improvements. The upgrades to the CDF detector, shown in Figure 2, in- 
clude increased muon system coverage, a larger vertex detector, and a new central tracker 
consisting of a drift chamber and additional silicon layers between the vertex detector 
and the drift chamber. A time-of-flight counter has been added for r/K separation up 
to 1.6GeV /c. An additional vertexing layer(L00) close to the beam significantly improves 
the reach of the experiment for B, mixing (see next section). The trigger system now 
includes a fast track search in the first level, and a silicon vertex tracking algorithm in 
the second level. This will lower the p, threshold for muons, as well as providing a two 
track trigger for hadronic decays 


With an integrated luminosity of 2fb ^! in the first period of Run II, it is expected that 
the CP violation parameter sin 26 will be measured with an error of 0.043 (Tanaka 2001). 
A B, mixing observation will be possible up to z, = 60 at 5c significance. AT/T for B, 
mesons will be measured with an error of 0.05 using the decay channel B, — J/ó. It is 
also expected that the forward-backward asymmetry App in the rare decay By > K* up 
will be measured. The bb production cross section will be studied further, as well as 
exclusive decays of the B, meson, B, > J/V£v and J/wr. 


3 How to measure B, mixing 


To observe mixing in the B, system requires a measurement of the probability that a B, at 
time t = 0 turns into D, when it decays at a later time t. The B, has to decay to a flavour 
specific final state with a precisely measured decay vertex and B, momentum. Several 
decay channels have been considered, including the semileptonic channel B, > pD,v, 
where D, > $r*, K* Ks, K* K*, the hadronic channel B, > J/K*, where J/w > pp, 
K* — Kn, as well as more conventional hadronic decays such as B, > D(*9n* or 
D(? rtrt. A flavour tag is needed to determine the initial B, flavour at t = 0. This is 
best provided by the charge of the kaon or lepton from the associated decay of the other 
b quark. 


In what follows, a simplified picture is given of the maximal mixing parameter z, 
an experiment can measure, as a function of the resolution and available statistics. The 
interested reader should consult Moser & Roussarie 1997 for a full discussion. For a 
perfectly tagged B, at t—0, the probability that this B, meson decays as a D, depends 
on the proper time £ according to (1 — cos(z,t/r)) exp(—t/T). Imperfect tagging and 
backgrounds reduce the amplitude of the cosine term by a *dilution" factor (D): 


r(B, 2 B,) = e™ (1 — Dcos(z,t/7)). 


For lepton tagged events the value of D is about 0.19 (Figure 6). 
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Figure 6. Erample of B, mizing with £s = 30 (Am, = 22ps ! ): (a) perfect resolution 
and no dilution, (b) with a dilution factor D = 0.19, (c) with dilution factor D = 0.19 
and a finite resolution o, = 45fs 


The amplitude of the mixing signal is further reduced by the finite resolution in proper 
time t. In the limit of very good resolution (c, & z,T), a simple quantitative estimate 
of the effect can be made by observing that the amplitude is reduced by events moving 
from the “up” side of the wave to the “down” side, the effect being most important in 
an interval of order o, around the node of the oscillating term (Figure 7). In this region 
the function is approximately linear, and the fraction of events in this interval can be 
roughly estimated to be o?x?/27?. A full derivation involves a convolution of the time 
evolution curve and the detector response. The amplitude turns out to be reduced by 
a factor exp(—o?x2/2r?), in agreement with the rough estimate to first order in o7/7?. 


rate 
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Figure 7. Dilution of the oscillatory term due to finite resolution. 
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The dilution D is replaced by D' = Dexp(—o222/27?) , giving the distribution shown in 
Figure 6(c). 

A simple estimate of the statistics required for a significant measurement can be made 
by fixing z,, and dividing the events into two classes, those from the *up" part of the 
wave, and those from the "down" part (Figure 7). The measured oscillation amplitude 
for a given x, differs from zero if the two classes are found to be differently populated. 
The distribution over the two classes is binomial, with probability for the *up" part equal 
to p = 0.5 + cD’, where c is a constant of order 1. The error on p, o(p) = J/p(1 — p)/N, 
for a measurement with N reconstructed and tagged events. This expression can be 
simplified for p = 0.5 to o(p) ~ 1/(2V/N). The resulting error on the amplitude D' is 
op 7: 1 (2e / N), and the significance of the measurement is: 


2,2 


D' [op z: 2cV ND exp -45 | | 
27? 


For a given significance, the number of events needed, and thus the duration of the 
measurement, is proportional to exp(c2z2/27?). This is a very steep function of the 
proper time resolution o; and the mixing parameter z,, particularly in the region where 
the quadratic term in the exponent is greater than 1. In the upgraded CDF detector, 
where o,=45fs is expected, this value is reached at z, = 30. 


4 HERA-B 


HERA-B is a fixed target experiment at the storage ring HERA at DESY (Lohse et al. 
1994). The experiment was originally designed to measure CP violation in the decays of 
neutral B mesons to J/yK% (Figure 8). Bottom hadrons at HER A-B are produced with 
a beam of 920GeV protons from the storage ring which hits an internal wire target. The 
average momentum of the B mesons is 120GeV, so they travel about 1cm before they 
decay. The J/*? from the decay B? — J/V K9 can be triggered and reconstructed in its 


p 
920 GeV 


lepton tag 


k- 
tagging b quark 


Figure 8. A B > J/yK® event showing the typical momenta and decay distances 
expected in the HERA-B experiment. 
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dileptonic decay mode. From two existing measurements of c (bb) in the relevant kinematic 
region (Jansen et al. 1995, Alexopoulos et al. 1999), the fraction of events containing 
b quarks is of order 1075. Since the branching ratio for the decay B^ > J/K} is of 
order 1073, and J/y — II is 6%, the ratio of the signal channel to the total inelastic 
cross section is of order 107!!. The HERA-B design requires a very high interaction rate 
of 40MHz, with an average of four proton-nucleon interactions occurring for each bunch 
crossing (every 96ns). This interaction rate leads to fluxes of charged particles through 
the detector components which are comparable to the fluxes expected at the LHC. 
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Figure 9. The HERA-B spectrometer. 


4.1 The HERA-B spectrometer 


HERA-B is a large magnetic forward spectrometer (Figure 9) with outer dimension of 
20m x9m x 7m. It covers a polar angle between 10mrad and 200mrad with respect to the 
beam direction corresponding to a 9096 coverage in the center of mass system. The main 
detector components are wire targets, a vertex detector system, a tracking system, a ring 
imaging Cerenkov counter, an electromagnetic calorimeter, muon chambers and a trigger 
system. 

An important requirement for HERA-B was to not interfere with HERA operation for 


ep collider experiments. The solution was to put several target wires in the halo of the 
proton beam (Ehret 2000). Protons which leave the core of the beam, and are therefore 
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Figure 10. HERA-B wire target concept (left), reconstructed verter positions (right). 


lost for collisions, interact in 8 wires which are arranged in 2 stations (Figure 10). This 
arrangement allows for an efficient separation of primary vertices for events with multiple 
interactions. The interaction rate can be changed by moving the wire targets within the 
beam halo, and the number of wires in operation can be varied by retracting some of them. 
Operation of the wire target is very stable, and meets the criteria for small interference 
with collider operation. 


4 stereo views / SL 
12.5, 87.5, 92.5 deg 


Figure 11. The HERA-B vertex detector system. 


The vertex detector system (VDS) has to provide track co-ordinates for reconstructing 
decay vertices and the impact parameters of decay products. The VDS (Masciocchi 2001) 
consists of 64 double-sided silicon microstrip detectors assembled in eight super -layers of 
four quadrants around the proton beam axis (Figure 11). To minimize multiple scattering, 
the first seven super-layers are placed in a Roman pot configuration. The VDS has 
successfully operated for several years with interaction rates of up to 40MHz. It has a hit 
efficiency better than 97%, and a hit resolution of 10 um. 
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Figure 12. The HERA-B Outer Tracking system: honeycomb drift chambers (top), 
modular structure (bottom). 


The tracking system extends over a total length of 11m. The first group of super- 
layers are situated inside the field of a dipole magnet, while the others are in the field-free 
region. The particle flux increases towards small polar angles, so the system is divided 
into inner and outer parts consisting of detector elements with different granularity. The 
Outer Tracker (OTR) measures charged tracks over most of the geometrical acceptance. 
It consists of honeycomb drift chambers of two different granularities (Figure 12), covering 
a total area of about 1000m?. The system has to operate in a high radiation environment, 
which led to some serious aging problems (Hohlmann 2001). During the year 2000 run, 
the overall hit efficiency was also somewhat worse than expected, due to HV failures of 
individual wires. We note that the HERA-B outer tracking system is the first large area 
detector working in a LHC-like environment. 


The inner tracker (ITR) measures tracks at small distances from the beam, and provides 
input for the first level trigger (Grad! 2001). The ITR elements cover the region from 
6cm to 25cm around the beam pipe, where the particle flux can reach 10° per square 
centimeter per second. The detectors are microstrip gas chambers with a gas electron 
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Figure 13. MSGC-GEM detector structure. 


multiplier stage (GEM), as shown in Figure 13. After an extensive conditioning phase, 
the operation of the ITR was stable in the year 2000 run, with performance close to the 
design value. Performances of the complete tracking system for K b and A candidates are 
shown in Figure 14. 
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Figure 14. Performance of the HERA-B tracking system: K} and A signals 


A ring imaging Cerenkov counter (RICH) identifies charged hadrons from the photons 
emitted in a gaseous C4F o radiator (Ariño et al. 2000). The photons are reflected by two 
mirrors and collected as rings on multianode photomultipliers (Broemmelsiek 1998), using 
a lens system (Figure 15). To match the variation in occupancy and reduce the cost, 4 
and 16 channel versions of the Hamamatsu R5900 multianode PMT are used. The system 
has typical rates of IMHz per channel in the hottest parts. It has been in operation since 
summer 1998, with very few noisy or dead channels (Figure 16). 

For ultrarelativistic particles, the average number of photons in a ring is 32, and the single 
photon resolution is oC = 0.7mrad for the finer, and 1mrad for the coarser granulari- 
ties, in excellent agreement with the design values. Figure 17 shows the probability that 
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Figure 15. HERA-B RICH: radiator vessel (left), light collection system (right) 


Figure 16. HERA-B RICH: a low occupancy background event with two rings on the 
photon detector (left), a typical event (right). 
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Figure 17. HERA-B RICH: particle identification efficiency from J/y > pup. 


a muon from a J/% decay is recognized as a muon by the RICH, as a function of muon 
momentum. The performance of RICH is expected to improve with better alignment of 
the tracking system. 


The electromagnetic calorimeter (Avoni et al. 2001) measures photon energies, identi- 
fies electrons and provides an electron pretrigger. The calorimeter is of the “shashlik”-type 
with three different granularities in order to match occupancies. The measured energy 
resolution, o(E)/E = 1.7% @ 22.596/ V E, is close to the design value. 


The muon detector (Eiges et al. 2001) separate muons from hadrons and gives a muon 
pretrigger. It consists of four layers of detector elements embedded in iron absorbers. 
Three types of detector elements (tube, pad and pixel chambers) are used in order to 
match the required granularity. In the 2000 run, the muon pretrigger provided a good 
J/w signal using a fourfold coincidence of signals in the pad chambers. The performance 
was limited by inefficiencies in certain parts of the detector. 


The huge background from inelastic proton-nucleon scattering requires a highly selec- 
tive and efficient trigger. The main idea of the HERA-B trigger (Gerndt 2000, Kreuzer 
2001) is to search for a pair of high-p; leptons, with invariant mass consistent with a J/wv. 
At the pretrigger level, data from the calorimeter, muon and tracking systems are used 
to define regions of interest which correspond to particles with high transverse momenta. 
Pattern recognition and track reconstruction are performed with dedicated processors at 
the tracking system stations (Figure 18), starting from the track candidates defined by the 
pretrigger system. A cut is made on the invariant mass of candidate track pairs The first 
trigger level aims to reduce the rate by a factor 200. Second and third level triggers refine 
track candidates and search for vertices. They are designed to give suppression factors of 
100 and 10 respectively. A fourth level performs the complete event reconstruction and 
writes data to storage. The best results in the 2000 run were achieved with a first level 
trigger requirement of a single high-pt lepton; and a search for the second high-pt lepton 
in the second level trigger. We note that in this period the first level trigger efficiency 
was 2096 for single muons and 5496 for electrons, which is well below the design value of 
90%. This is mainly due to inefficiencies in the tracking and muon chambers. 
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Figure 18. HERA-B first level trigger system: pretrigger candidates are followed through 
track finding units (TFU), combined into pairs in track parameter units (TPM); before a 
final decision by the trigger decision units (TDU). 


4.2 HERA-B status and plans 


The experiment was approved in 1995, but the design and construction of detector ele- 
ments that can cope with the extremely high fluxes proved to be a technological chal- 
lenge. Problems with the tracking system caused a substantial delay in the schedule, and 
installation of the full detector was only completed in February 2000. The first phase of 
commissioning lasted 6 months (HERA-B Collaboration 2000). The performance of the 
detector for Jy events is shown in Figure 19. 


After the end of the current HERA shutdown, a continuation of the operation of 
HERA-B is expected in the second half of 2001. Due to the late start and the trigger 
inefficiencies, HERA-B is no longer competitive for a measurement of CP violation. In 
the 2001/2002 running period the experiment will concentrate on the measurement of 
events which contain a J/ meson, for which the detector is optimized. Based on the 
results of the 2000 run, and detector improvements, it is expected that 1500 reconstructed 
J/w events will be collected per hour, resulting in 2.1 million events in the full running 
period. With these data HERA-B can measure the nuclear dependence of charmonium 
production, and the bb production cross-section. 


A suppression of the yield per nucleon of J/w in proton-nucleus, pion-nucleus, and 
heavy ion collisions is observed in heavy nuclei compared to light nuclei (Leitch et al. 
2000, Alde et al. 1991, Leitch et al. 1995). The effect depends strongly on the zp and 
pr of the J/*. Various mechanisms can contribute to this suppression, but their relative 
importance is not clear. HERA-B can measure the production of J/y mesons down 
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to rp = —0.4, as well as v" and Xe, using the electromagnetic calorimeter to reconstruct 
Xe > J/wy. Running simultaneously with target wires made of different materials reduces 
the systematic errors. Last year HERA-B took data using Ti and C wires. 


A measurement of of the production cross-section oj is needed, because there are 
two inconsistent results (Jansen et al. 1995, Alexopoulos et al. 1999). For the related 
processes, A — bbX, the theoretical predictions underestimate the cross section. The 
measurement of oy; at HERA-B will be based on inclusive b + J/YX compared to direct 
J/? production. The J/y candidates with decay vertices detached from the target wire 
will be attributed to decays of b hadrons. It is expected that 340 events of this type 
will survive all cuts with a small background. The result for oj; is expected to have a 
statistical accuracy of 5-10%. 


HERA.B is the first experiment working in a LHC-like environment, As such it has 
accumulated valuable experience during construction and commissioning which is vital 
for the success of the next generation of hadron experiments. After further improvements 
during the 2001 shutdown, final commissioning of the detector will be completed. 
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5  BTeV - a second generation B factory 


The motivation for the BTeV experiment is the expectation that in the next few years 
the e*e- B-factories will firmly establish the evidence for CP violation in B decays, but 
that several questions will remain open due to lack of statistics, and the restricted set of 
b hadron decays that are available at the ete machines. The goal of B TeV is to perform 
high precision measurements of the CP parameters y, a, x, in the decays of B^, B+ and 
B, mesons, and to search for new physics in rare and flavour changing neutral current 
decays (Stone 2001). 
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Figure 20. Production of bb pairs at BTeV: dependence of boost on the pseudorapidity 
(left), correlation of production angles of the bb pair (right). 


The BTeV experiment (Santoro 1997, Skwarnicki 2001) will exploit the very large 
production cross section for bb pairs in the forward and backward direction in pp collision 
at the Tevatron (Stone 1997). The boost factor Gy becomes very large in this region, 
compared to the central region covered by CDF and D0. Figure 20 shows that b quark 
pairs are either both in the forward or both in the backward direction, so a single arm 
spectrometer similar to HERA-B would in principle be sufficient for flavour-tagged CP 
measurements. 


5.1 BTeV spectrometer 


To accomplish its goals, the BTeV spectrometer (Sheldon 2001), shown in Figure 21, has 
to meet the following requirements. It has to efficiently trigger on hadronic B decays, and 
it needs a high resolution vertex detector to measure B, oscillations. An excellent particle 
identification system is required for 7/ K separation, and a high resolution electromagnetic 
calorimeter for 7° reconstruction. All the detector components have to survive high 
interaction rates. Finally, it needs a high capacity data acquisition system that can 
handle a large number of read-out channels. 


The BTeV tracking system has three major parts, a vertex pixel detector, a system 
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Figure 21. BTeV spectrometer with an expanded view of the interaction region. 


of straw-tube chambers as the main tracking device, and silicon strip detectors in the 
vicinity of the beam pipe. The pixel detectors are essential to eliminate ambiguities and 
provide a detached vertex trigger in the high track density region (Artuso 2001). The 
detectors are expected to have a low noise. The read-out electronics is subject to the 
same radiation load as the detector, but it has been possible to develop radiation hard 
electronics using the deep sub-micron process (Christian 2001). 


The identification of hadrons is essential for the tagging of b-flavour in CP violation 
and mixing measurements. It is also crucial for the identification of hadronic final states 
such as B — ntr. The importance of the latter is illustrated in Figure 22, where the 
B — ntr” events are obscured by a large background from other, kinematically similar, 
two-body decays. The kaon and pion momentum range that the identification device has 
to cover, can be deduced from Figure 23. To achieve the necessary 7 / K separation in the 
range 3 « p « 70GeV/c, a RICH counter will be used (Figure 24), with the same radiator 
gas, C4F1o, as in HERA-B. To supplement the performance in the low momentum region 
and to improve p/K separation, plans are being discussed to add an aerogel radiator. 
A novel type of a position sensitive photon counter will be used (Figure 25, the hybrid 
photon detector (HPD) (Alemi 2000). A high electric field is used to accelerate photo- 
electrons onto a silicon pixel detector. The HPD is still under development, and is also 
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Figure 22. Invariant mass distributions as determined by MC simulation for the B > 
ata decay together with the two body background sources, B > Kr, B, > KK and 
B, — Kn, without particle identification criteria. 
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Figure 23. Momentum spectra of tagging kaons (left) and pions from B + ntr” (right). 


foreseen for use in the LHCb experiment (Wotton 2000). 


The calorimeter needs to have high resolution and good radiation tolerance. For 
this BTeV will use 22000 Lead Tungstate crystals (PbWO,), with size 27x27 x2220mm?. 
This is the same technology as for the CMS experiment. The crystals are radiation 
hard, fast scintillators with 99% of the light emitted in less than 100ns. They will be 
read out with photomultiplier tubes. The energy resolution predicted by a full GEANT 
simulation is c(E)/E = 0.55% 6 1.696/ / E(GeV), and the position resolution is given by 
Oz = 3500um/ E(GeV) & 200um. 

'The muon detector will be used both to identify muons, and to provide an auxiliary di- 
muon trigger for checks of the detached vertex trigger, particularly in the commissioning 
phase of the experiment. It consists of two toroids with three detector sets. 
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Figure 24. BTEV RICH: detector layout with aerogel and C4Fio radiators, spherical 


mirrors and HPD photon detectors (left), momentum dependence of the Cerenkov angle 
in the two radiators (right). 
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Figure 25. The HPD design for the BTeV RICH. 


One of the main components of the BTeV spectrometer is the trigger system (Artuso 
2001). The first level trigger is meant to find segments in the entrance and exit of the 
pixel detector as shown in Figure 26. It will then match segments, get crude momentum 
estimates, and reject low momentum tracks from further use for vertex determination. In 
the next step, the primary vertex will be found, and then for each of the tracks which 
miss the primary vertex, the significance of the detachment parameter, b/o(b), will be 
calculated (Figure 27). 


The goal of the first level trigger is a factor of 100 rejection of bunch crossings which 
do not produce a bb pair, and an efficiency exceeding 50% for interesting events. As an 
example, a requirement of two tracks with b/o(b) > 6 leaves 1% of beam crossings, and has 
63% efficiency for B9 + ntr, 70% for B© — D? K-, and 50% for B? — Jf K9. The 
dependence of the signal efficiency and the background rejection power on the detachment 
parameter cut for different numbers of tracks is shown in Figure 28. 
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Figure 26. Event in the pizel detector, with the horizontal coordinate compressed by 
factor 10. Left: raw event, right; segments found at the entrance and exit of the detector. 
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Figure 28. Level 1 trigger efficiency for B, > D,K (left) compared to the efficiency for 


minimum bias events (right). 
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The second trigger level will rely on re-fitting the pixel segments, including more hits, 
and linking these tracks to the downstream tracking systems. This will give an accurate 
momentum measurement, and a better determination of the detachment parameter of the 
track. The aim is to achieve a further rejection of a factor of 10 or more at 9096 efficiency. 
Finally a third level will involve more processing, with the goal of reducing the event 
output rate of 4kHz, with an event size of 40kbyte. 


5.2 BTeV physics reach 


The comparisons shown in Table 2, are meant to illustrate in which respects the B TeV 
experiment is a second generation B-factory. In the first column we list the expected 
numbers of events collected by the e*e^ B-factories with the integrated luminosities of 
500fb^! which are expected by 2005. In the second column we present the amount of 
data the BTeV experiment is expected to reconstruct in one effective year (10’s). The 
interested reader is referred to a review article (Stone 1997) for further discussions, as 
well as a comparison between the BTeV and LHCb experiments. 


Mode ete” B-factories BTeV | 
with 500fb7! yield in 10’ s 

Ba > tne 850 f 24000 

Ba OK 700 11000 

Ba óKs | 250 (75 tagged) | 2000 (200 tagged) 

Ba > K* up 50 2200 

B, 3 Jig - 15000 


Table 2. Comparison of expected BTeV yield of reconstructed events in some interesting 
decay modes with the expectations from the e* e^ B-factories by the year 2005. 


6 Summary 


We summarize this review by observing that although the hadron production of B mesons 
has a large cross section, the background levels are high, so elaborate triggering strategies 
are needed to extract the signal events. In addition an efficient operation of the detector 
in a high rate environment is required over extended run periods. The results of B physics 
studies by the CDF experiment prove that B physics can be done at hadron machines. The 
HERA-B spectrometer components are ready after extensive R& D to develop radiation 
hardness and high rate tolerance. The trigger of the HERA-B spectrometer is being 
commissioned, and a year of data taking is expected in 2002. This will provide valuable 
experience for the next generation of experiments at hadron machines. Finally, the second 
generation of B factory experiments, BTeV and LHCb, are being developed, and are 
expected to open a new era of B physics after 2005. 
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1 Introduction 


A small matter-antimatter asymmetry of the weak force has been experimentally estab- 
lished. This CP violation may be related to the small excess of matter from the big bang. 
The nature of CP violation in the K? system has been clarified after 37 years of exper- 
imentation as being due to a non-trivial phase in the weak quark mixing matrix which 
generates "direct CP violation" in the weak Hamiltonian. The experiments demonstrating 
this direct CP violation are discussed. 


2 The big bang and the expanding universe 


In our present universe, we observe with optical telescopes about 100 billion (10!!) galaxies 
each of them with 100 billion stars. These stars for most of their life burn hydrogen to 
Helium, through nuclear fusion. They emit visible light from their surface and neutrinos 
from their interior. Both of these radiations have been observed from the sun. 


The analysis of the visible wavelength spectrum reveals black absorption lines due to 
the absorption of light in the atoms of the outer layers of the sun or other stars. A large 
step in the understanding of the evolution of the universe was made when Edwin Hubble 
observed in 1929 that these absorption lines for a given element are shifted towards the red 
end of the wavelength scale for stars which are very far outside our galaxy, say 100 million 
light years away. This *redshift" is interpreted as being due to these stars receding from 
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Figure 1. Galaxies receding from us with velocities indicated by the length of the arrows 


us. Hubble observed that the size of the redshift, and the velocity of these receding stars is 
proportional to their distance from us. As an example, there is a galaxy in the Ursa Major 
which moves away with 1/7 of the light velocity, i.e. 42000km/sec. Figure 1 visualizes the 
flight velocities of such galaxies, with arrows indicating the size of the recession velocities. 


Of course, our point of reference, the earth, is by no means a special one. According to 
relativity theory, each reference system is equivalent in our finite, but unlimited universe. 
Each galaxy recedes from every other with a velocity proportional to their distance, like 
two points on a expanding rubber balloon. Turning the time arrow backwards, one con- 
cludes that these objects must have been very close to each other, about 15 billion years 
ago. At that time, an incredibly hot fireball of matter and radiation started to expand 
in the “big bang”. 


During the big bang, at high temperatures above 10°° degrees Kelvin, matter docs not 
consist only of the stable building blocks which we observe in our old and cold universe, the 
protons and neutrons of the nucleus, and the electrons and neutrinos. The fundamental 
constituents of protons and neutrons, the up (u) and down (d) quarks, play their role, 
but there are also two more massive families of fundamental particles which have been 
produced and discovered in modern accelerator laboratories over the last few decades. 
Thus we have altogether three families: 


1: up (u) and down (d) quarks, electrons (e) and their neutrinos (v) 
2: strange (s) and charm (c) quarks, muons (5) and their neutrinos (va) 


3: bottom (b) and top (t) quarks, tau leptons (7) and their neutrinos (v,) 


In addition to these matter particles, it was possible to create their antiparticles at 
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Figure 2. Production of an electron-positron pair in a hydrogen bubble chamber 


the time of the big bang . In the universe today there is no indication of antimatter left 
over from the big bang. If antimatter exists, it must show up through the y radiation 
which is produced when antimatter annihilates with matter. In the big bang, matter and 
antimatter were made at extremely high temperatures and in equal quantities because the 
forces which are responsible for their production are completely symmetric with respect to 
matter and antimatter. As an example Figure 2 shows the production of an electron and 
a positron from a photon hitting liquid hydrogen in a bubble chamber. The curvature 
of the two trajectories is due to a magnetic field which allows to distinguish between 
positive and negative electric charges. In a similar way, the other elementary constituents 
of matter were produced in pairs during the initial phases of the big bang. In this hot 
fireball, creation and annihilation of particles and antiparticles led to an equilibrium of 
approximately equal numbers of particles, antiparticles and photons. The temperature 
(measured in degrees Kelvin) corresponds to the average energy of the particles (measured 
in electron-volts, eV). The expanding fireball cools down, and below a certain temperature 
the creation of particle-antiparticle pairs stops while annihilation goes on. If all forces 
were symmetric with respect to matter and antimatter, there would be no matter left 
over in the cold phase except for photons. As a result of the expansion of the universe 
these photons would be shifted to lower energies. This cosmic microwave background was 
discovered in 1965 by Penzias and Wilson. The wave-length spectrum perfectly matches 
the Planck radiation of a black body at a temperature of 2.73 degrees Kelvin (Figure 3). 
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Figure 3. Frequency distribution of the cosmic microwave background variation, as 
measured by the COBE satellite 


This is the echo of the hot photons from the big bang. Today the density of this radiation 
is 500000 photons per litre. 


Not only is there no primordial antimatter in our universe. The ratio of nucleons over 
photons is now only about 10 !?, whereas it was of order one in the early phases of the 
universe. À possible explanation was given by Sacharov and Kuzmin. They stated that 
this small surplus of matter is only possible if 


e one force violates matter-antimatter symmetry, 
e baryon number is violated and 


e the expansion goes through phases when there is no thermodynamic equilibrium. 


3 Symmetries 


Symmetries and conservation laws have long played an important role in physics. The 
simplest examples of macroscopic relevance are the conservation of energy and momentum, 
which are due to the invariance of forces under translation in time and space, respectively. 
This was demonstrated by Emmy Noether. 


Sacharov postulated that one of the known forces or a new force violated the sym- 
metry between matter and antimatter, thus producing a small surplus of matter. All 
the remaining matter annihilated with the corresponding amount of antimatter, and at 
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the end we are left with a surplus of 107! of matter compared to the photons. Such a 
symmetry violation goes against principles which were cherished for centuries. 


In the domain of quantum phenomena, there are conservation laws corresponding to 
discrete transformations. One of these is reflection in space by the parity operator P. 
Invariance of the laws of nature under P means that the mirror image of an experiment 
yields the same result in its reflected frame of reference as the original experiment in the 
original frame of reference. This means that "left" and "right" cannot be defined in an 
absolute sense. 


Similarly, the particle-antiparticle conjugation operator C, transforms each particle 
into its antiparticle, whereby all additive quantum numbers change their sign. C in- 
variance means that experiments in a world consisting of antiparticles will give identical 
results to experiments in our world. Here again it will be not possible to define in an ab- 
solute way whether an object consists of antimatter or matter. An anti-atom composed 
of antinucleons and positrons emits the same spectral lines as the corresponding atom. 


A third transformation of this kind is the time reversal operator T, which reverses 
momenta and angular momenta. According to the CPT theorem (Lüders 1954, Pauli 
1955), there is a connection between these three transformations such that under rather 
weak assumptions in a local field theory all processes are invariant under the combined 
operation CPT. 


For a long time it was assumed that all elementary processes are also invariant under 
the application of each of the three operations C, P, and T separately. This assumption 
was questioned by (Lee & Yang 1956), and the subsequent experiments demonstrated 
the violation of P and C invariance in weak decays of nuclei and of pions and muons. 
This violation can be visualized by the longitudinal polarization of neutrinos ernerging 
from a weak vertex: they are left-handed when they are particles and right-handed when 
antiparticles. Application of P or C to a neutrino leads to an unphysical state (Figure 4). 
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Figure 4. The mirror image of a left-handed nevtrino under P, C, and CP 
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The combined operation CP, transforms a left-handed neutrino into a right-handed 
antineutrino, thus connecting two physical states. CP invariance was therefore considered 
(Landau 1957) to be a valid replacement for the separate P and C invariance which was 
no longer true in weak interactions. 


A unique testing ground for CP invariance are elementary particles called neutral K 
mesons. They have the property that the particle K? and its antiparticle K? differ in one 
quantum number, called strangeness, but can mix through a transition mediated by the 
weak interaction. A consequence of CP invariance for the neutral K mesons was predicted 
by (Gell-Mann & Pais 1955). It was then known that there was a V" or K? particle that 
had a short lifetime (10^ !? sec). It was predicted that there should be a long-lived partner, 
K$, and that these two physical particles are mixtures of the two strangeness eigenstates, 
K? (S = + 1) and K?(S = - 1) that are produced in strong interactions. Weak interactions 
do not conserve strangeness, but the physical particles should be eigenstates of CP if the 
weak interactions are CP invariant. With K? — CP K?, the eigenstates are: 


CP K, = CP((K?-K9)//2) = (K+K) = K, 


CP K = CP [(K°— K°)/V2] = (K?— K?)//2 = -K 

Because CP (ntr) = (x*z7) for two r mesons in a state with zero angular monien- 
tum, the decay into 7*7 is allowed for the K,, but forbidden for the Ky. Hence the K; 
is predicted to have a longer lifetime, because it must decay to three pions. This was 
indeed confirmed when the long-lived Ky was discovered. 


It was discovered by (Christenson, Cronin, Fitch & Turlay 1964), that the long-lived 
neutral K meson also decays to 7*7, but with a small branching ratio of ~ 2 x 107%. 
From then on the long-lived state was called K;, because it was no longer identical to the 
CP eigenstate Ky. Similarly, the short-lived state was called Ks. 


4 Phenomenology and models of CP violation 


The phenomenon of CP violation in decays of neutral K? mesons has been with us for 37 
years. The first ten years of intense experimentation after the discovery of the decay 
Kj — ntr” were devoted to the observation of other manifestations of the phenomenon, 
like the decay K; — n9? (Gaillard 1967, Cronin 1967), and the charge asymmetry in the 
decays K, — n*e*v and Kj, — v*yu*v (Bennett 1967, Dorfan 1967), and to precision 
experiments on the moduli and phases of the CP violating amplitudes (Kleinknecht 1976). 
These experimental results excluded a large number of theoretical models proposed to 
explain CP violation, such that at the time of the London conference (Kleinknecht. 1974) 
only two classes of models survived. The superweak model postulated a new, very weak, 
CP violating interaction (Wolfenstein 1964) with AS — 2, whereas milliweak models 
invoked a small (107?) part of the normal AS — 1 weak interaction as the source of CP 
violation. In the milliweak model there is also a direct decay of a Kaon state with CP 
quantum number -1 into a two-pion state with CP +1 through the weak Hamiltonian. 
This is called “direct” CP violation, as opposed to "indirect" CP violation by A?/ A? 
mixing, which is a feature of both models. The key question is can one devise experiments 
to distinguish between these models? 
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In this context it was very important that a specific milliweak model within the Stan- 
dard Model was proposed (Kobayashi & Maskawa 1973). At the time of the discovery 
of CP violation, only 3 quarks were known, and there was no possibility of explaining 
CP violation as a genuine phenomenon of weak interactions. This situation remained 
unchanged with the discovery of the fourth quark, because the 2x2 weak quark mix- 
ing matrix has only one free parameter, the Cabibbo angle, and no non-trivial complex 
phase. Kobayashi and Maskawa remarked that this picture would change if six quarks 
are present. Then the 3x3 mixing matrix naturally contains a complex phase, as well as 
three mixing angles. It is then possible to construct CP violating weak amplitudes from 
*box diagrams" of the form shown in Figure 5. 


al 


u,c,t S 


Figure 5. Bor diagram for K°—K° mizing connected to CP violating parameter e. 


A consequence of this model of CP violation is that the relative decay rates for K; > 
ntr and K; > n?n’ are not necessarily equal. This direct CP violation is due to 
"Penguin diagrams" of the form given in Figure 6. 


Figure 6. Penguin diagrams for K? — 2r decay with direct CP violation. 


For a quantitative discussion, we use the conventional notation. The eigenstates of 
CP are: 


Ki (K? + K°)/V2=+CPK, 
Ky = (K*- K°)/V2=-CPK, 


The physical long-lived (Kz) and short-lived (Ks) states are defined as: 


Ks = (Ki +¢5K2)/(1 + es)? 
Kr, = (Ks T eL Ki)/(1 T le B)! 
With CPT invariance, eg = e; = e, where: 


c= Iml 2/2 + iIm Mj. 
ills — T;)/2 — (ms — mi) 
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is given in terms of the elements of the K? — K? mass matrix M and decay matrix T. The 
phase of e, is 
arg(e) = arctan(2Am/T's) = 43.7? x 0.2°. 


The experimentally observable quantities are 


In. let =n _< ntr-|T|K, > 
un ui < rtr [T|Ks >’ 
< nm IT|K, > 


ioo — . 
Imole "oo < Pr |T| Ks > 


It can be shown that these amplitude ratios consist of a contribution from CP violation 
in the box diagrams describing K? — K? mixing, called e above, and another one from CP 
violation in the penguin diagrams associated with weak K — 27 amplitudes, called e: 


Ny —e-4é Noo = €— 2€ 


The quantities 7,_, noo and 3e' form the Wu-Yang triangle in the complex plane. The 
CP violating decay amplitude ¢’ is due to the interference of a AJ = 1/2 amplitude, Ag, 
and a AI = 3/2 amplitudes, A: 


1 iIm A; 
^. 24g 


exp[i(ó; — 69)] 
and its phase is given by the 77 phase shifts in the J = 0 and I = 2 states, 69 and ó;: 
arg(e) = (65 — ôo) + 1/2 


which is measured experimentally to be (61 + 9)? (Biswas 1981) or (45 x: 9)? (Ochs 1981). 
The two models discussed above differ significantly in their predictions. In the superweak 
model there is no direct CP violation in weak decays, e' = 0, and therefore n4- = rog = €. 
In milliweak models it is likely that € Z 0. 


5 Theoretical estimates for the parameter e' 


The parameter ¢’ can be estimated within the weak quark-mixing model of Kobayashi and 
Maskawa if one infers the magnitude of the quark mixing angles from other experiments, 
and calculates the hadronic matrix elements for box graphs and penguin graphs. Typical 
values for |e'/e] are in the range +0.2 — 3.0 x 107? for three generations of quarks. A 
measurement of this quantity to this level of precision therefore becomes crucial for our 
understanding of CP violation. Since the phase of e' is close to that of e, and since 
|e’ /e| < 1, we get: 


i 


€ e 1 
—-—RHRel—-|2-í1- Hu 2 
z ((£) sí [n00/7+ P) 
Various methods are used to calculate the value of Re(c'/c). Due to the difficulties in 
calculating hadronic matrix elements, which involve long distance effects, the task turns 
out to be very difficult. The following results have been obtained recently: 
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1. The Dortmund group uses the 1/Nc expansion and Chiral Perturbation Theory 
(XPT). They quote a range of 1.5 x 1074 < e'/e < 31.6 x 107* (Hambye 1999) from 
scanning the complete range of input parameters. 


2. 'The Munich group uses à phenomenological approach in which as many parameters 
as possible are taken from experiment. Their result (Buras 1999) is 1.5 x 107* « 
e /e < 28.8x10-* from a scanning of the input parameters and &'/e = (7.7:$9) x 107^ 
using a Monte Carlo method to determine the error. 


3. The Rome group uses lattice calculation results for the input parameters. Their 
result is e'/e = (4.7397) x 1074 (Martinelli 1999). 


4. The Trieste group uses a chiral quark model to calculate e'/e. Their result is 7 x 
107^ < e'/e < 31 x 107* (Bertolini 1998). 


It is hoped that reliable hadronic matrix elements will be obtained in the near future by 
lattice gauge theory calculations. 


A measurement of the double ratio 


R= Imo — T(K, > 27°)/T(K; > a*r) 
|In- (Kg > 229)/T(Ks > ntr) 


to a precision of better than 0.3% is required to distinguish between the milliweak and 
superweak models, and test these theoretical predictions. 


6 Experiments on direct CP violation 


6.1 The first observation of direct CP violation 


The first observation of direct CP violation was made in 1988 by a collaboration of physi- 
cists at CERN. Their experiment, called NA31, was based on the concurrent detection 
of n?n? and m*a” decays from alternating collinear beams of Ks and K;. Kaons with 
energies around 100GeV were produced by a 450GeV proton beam from the SPS accel- 
erator at CERN. The energies of the decay products were measured in a combination 
of a high-resolution Liquid-Argon electromagnetic calorimeter and an iron-scintillator 
hadronic calorimeter. In the first runs, about 100000 decays of the type A; — n?n? and 
295000 decays of Kj, — ntr were observed, and the result for the CP parameter was 
Re(c/e) = (33 + 11) x 107* (Burkhardt 1998). In further data-taking, the number of 
observed Kj, — n?n? decays was increased to 4 x 10°. 


A similar sensitivity was achieved by an experiment at Fermilab near Chicago, called 
E731. In 1992/93 the experiments NA31 at CERN and E731 at FNAL presented final 
results. The CERN result (Barr 1993) of (23.0 + 6.5) x 107* shows evidence for direct 
CP violation with a significance of more than 3e, whereas the Fermilab result (Gibbons 
1993) of (7.4 +5.9) x 107* is consistent with zero. As a consequence of this disagreement, 
two new experiments, NA48 at CERN and kTeV at Fermilab, were constructed to resolve 
the issue. 
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Figure 7. Layout of the main detector components of the NA48 experiment. 


6.2 The NA48 detector 
The new CERN experiment NA48 (Figure 7) was designed 


e to measure all four decay modes concurrently, 
e to register data at a rate 10 times higher than NA31, 


e to achieve an improved energy resolution for photons (liquid Krypton calorimeter) 
and for charged pions (magnetic spectrometer). 


In the design of the NA48 detector the cancellation of systematic uncertainties in the 
double ratio is exploited as much as possible. Important properties of the experiment are: 


1. Two almost collinear beams which lead to an almost identical illumination of the 
detector. 


2. The lifetime weighting of the events defined as K, events, to remove systematic 
errors due to detector acceptance. 


The K, target is located 126m upstream of the beginning of the decay region. As the decay 
lengths at the average kaon momentum of 110GeV /c are As = 5.9m and A, = 3 400m 
respectively, the neutral beam derived from this target gives predominantly Ki, decays in 
the decay region. A separate Kg target is located only 6m upstream of the decay region, 
so this beam is a source of mainly Ks decays. The two beams are almost collinear, with 
the Kg target situated 7.2cm above the center of the Kr, beam. The relative angle of the 
beams is 0.6mrad so that they converge at the position of the electromagnetic calorimeter. 


The beginning of the Kg decay region is defined by an anti-counter (AKS). which is 
used to veto Kaon decays occurring upstream of the decay region. The position of the 
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AKS also defines the global Kaon energy scale which is directly correlated to the distance 
scale. The decay region itself is contained in a 90m long evacuated tank. 


The identification of decays from the Ks beam is done by a tagger consisting of an 
array of scintillators situated in the proton beam which is directed onto the Kg target. 
If a proton signal is detected within a time window of +2ns with respect to a decay, the 
event is defined as Ks event. The absence of a proton tag defines a Ki, event. 


A magnetic spectrometer is used to reconstruct Ks, — 7*7 decays. The spectrom- 
eter consists of a dipole magnet with a transverse momentum kick of 265MeV/c, and four 
drift chambers which have a spatial resolution of ^ 901m. This leads to a mass resolution 
of 2.5MeV/c?. A hodoscope consisting of two planes of plastic scintillator provides the 
time of a charged event with a resolution of about 200ps. 


À liquid krypton electromagnetic calorimeter (LKR) is used to identify the four pho- 
tons from a 7°x° event. Liquid krypton has a radiation length of X9 = 4.7cm which 
allows one to build a compact calorimeter with high energy resolution AE/E = 1.35%, as 
measured with electrons coming from a Kj, + rev. It also has a very good time resolution 
of less than 300ps and very good linearity. It consists of 13212 cells (2x2cm?) pointing to 
the average Ks decay position. The transverse spatial resolution is better than 1.3mm. 


The electromagnetic calorimeter is complemented by an iron-scintillator sandwich 
calorimeter with a depth of 6.8 nuclear interaction lengths which measures the remaining 
energy of hadrons for use in the trigger for charged events. 


A muon veto detector, consisting of three planes of scintillator shielded by 80cm of 
iron, is used to identify muons and veto K, — muv events. 


6.3 Analysis of NA48 data 


To identify events coming from the Ks target a coincidence window of +2ns between 
the proton signal in the tagger and the event time is chosen (see Figure 8a,c). Due 
to inefficiencies in the tagger and in the proton reconstruction a fraction ag, of true Ks 
events are misidentified as K; events. On the other hand there is a constant background of 
protons in the tagger which have not led to a good Ks event. If those protons accidentally 
coincide with a true K, event, this event is misidentified as a Kg decay. This fraction ers 
depends only on the proton rate in the tagger and the width of the coincidence window. 


Both effects, at; and atz, can be measured in the charged mode as Ks and Kı, can 
be distinguished by the vertical position of the decay vertex (see Figure 8b). The results 
are ag, = (1.63 + 0.03)x10~* and af; = (10.649 + 0.008)%. This means that about 
11% of true Kr, events are misidentified as Kg events, however, this quantity is precisely 
measured to the 1074 level. What is important for the measurement of the double ratio 
R is the difference between the charged and the neutral decay modes Aaz;s = af — atz. 
Proton rates for both decays modes are measured in the sidebands of the tagging window 
to determine Aa;s. The result is Aazs = (4.3 + 1.8) x 107*. Several methods have been 
used to measure Aag,, leading to the conclusion that there is no measurable difference 
between these mistaggings with an uncertainty of +0.5x1074. 

Another important correction is the background subtraction. Decays Kr, —> mev and 
Ki, — muv can be misidentified as K + 7*7 decays, as the v is undetectable. However, 
since the v carries away momentum and energy, these events can be identified by their 
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Figure 8. (a) and (c): minimal difference between tagger time and event time (Atmin). 
(b) Atmin for charged Ki, and Kg events. (d) Comparison between charged and neutral 
event time. For this measurement decays, selected by the neutral trigger, with tracks are 
used (y conversion and Dalitz decays Ks — n^n9, + vyyye* e ). 


high transverse momentum p, and their reconstructed invariant mass. The remaining 
background can be measured by extrapolating the shape of the background in the mm — p? 
plane into the signal region. In this way the charged background fraction leads to an 
overall correction on R of (16.9 + 3.0) x10 *. 


A similar extrapolation can be done in the neutral decay mode. Here the background 
comes from Kj, — 32° decays, where two y are not detected. This leads to a misrecon- 
struction of the invariant 7° masses. In this case the background leads to a correction of 
R by (—5.9 + 2.0) x 10 *. 


The number of signal events after these corrections are summarised in Table 1. The 
efficiencies of the triggers used to record neutral and charged events have been determined. 
Independent triggers are used which accept a down-scaled fraction of events. In the 
neutral decay mode the efficiency is measured to be 0.99920 3: 0.00009 without measurable 
difference between Ks and K; decays. The z*«- trigger efficiency is measured to be 
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Event statistics (x 109) 
NA48 | KTeV NA48 | KTeV 
Ks > ata | 22.221 | 4.52 | KL > ata | 14.453] 2.61 
Ks 3 a7? | 5.209 | 142 | Kj > z?s? | 3.290 | 0.86 


Table 1. Event numbers after tagging correction (NA48) and background subtraction. 


(98.319 + 0.038)% for Ki, and (98.353 + 0.022%) for Ks decays. Here a small difference 
between the trigger efficiency in Kg and K; decays is found. This leads to a correction to 
the double ratio of (—4.5 + 4.7) x107*. The error on this measurement is dominated by 
the total number of events registered with the independent trigger. This error is one of 
the main contributions to the systematic error on the measurement of R. 


The distance D from the LKR to the decay vertex is reconstructed using the position 
of the four y clusters. From the kinematics of the decay one obtains 


ij 


where E; is the energy of cluster i and rj; the distance between cluster i and cluster j. 
This formula directly relates the distance scale to the energy scale. It is therefore possible 
to fix the global energy scale with the measurement of the known AKS position in the 
neutral decay mode. More checks on the energy scale and the linearity of the encrgy 
measurement can be performed using the invariant 7° mass and the known position of 
ar added thin CH; target in which a 7^ beam produces 7° — 2y. The comparison of all 
methods gives an uncertainty of +3107‘ in the global energy scale. 


Another systematic problem is the minimization of acceptance corrections, which is 
done by weighting the K, events. The difference in the lifetime between Ks and K, events 
produces a different illumination of the detector. There are more K; events decaying 
close to the detector, whose decays products are measured at smaller radii closer to tlie 
beampipe. NA48 weights the K; events according to the measured lifetime such that the 
distribution of the z-position of the decay vertices of Ks and K, events and the detector 
acceptances become equal. Using this method the influence of detector inhomogeneities 
is minimised and the analysis becomes nearly independent of acceptance calculations by 
Monte Carlo methods. The price to pay for this reduction in a systematic error is a loss 
in statistical accuracy. 


Although the acceptance is almost equal there are nevertheless small differences in the 
beam geometry and detector illumination between decays coming from the Ks and the Kj, 
target. These remaining differences are corrected using Monte Carlo methods. to calculate 
that the deviation of the double ratio R from an input value of 1 is (26.7 + 5.7) x 107*. 


Summing up the systematic uncertainties of all different sources in quadrature, gives 
a total of 12.4 x 107* in R, and 2.1 x 1074 in e’/e. 
The result of NA48 using the data samples from 1997, 1998 and 1999 is(Fanti 1999, 


Lai 2001) 
e/e 2 (15.3 + 26) x104 (1) 
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6.4 The KTeV detector 
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Figure 9. Layout of the main detector components of the KTeV experiment 


The main elements of the KTeV detector (Figure 9) are similar to those in NA48, 
since both experiments work in a similar environment. The main difference is the way in 
which the Ks mesons are produced. KTeV uses two parallel well separated kaon beams 
derived from a single target. In one beam, the K; mesons from the target pass through a 
collimator and then decay in an evacuated decay region of 30m length. In the other beam 
the K; mesons traverse a slab of matter, known as a “regenerator”. In the regenerator, 
the K? and K’ components of the K, are affected differently by interactions with matter. 
Therefore the beam emerging behind the regenerator is a slightly different superposition 
of these two components. In the forward direction, it contains a As component which 
is coherent with the outgoing Kz component. This is the regenerated K beam used in 
the experiment. The regenerator alternates between both beams every minute in order 
to keep the detector illumination identical for the Ks and the K, components. 


The decay region of the regenerator beam is defined by a lead-scintillator counter at 
the downstream end of the regenerator. The decay region of the vacuum beam starts at 
the first photon veto detector. 


The KTeV magnetic spectrometer consists of four drift chambers, with the magnet 
providing a momentum kick of p, — 412MeV/c, leading to a mass resolution given bv 
o(m,+,-) = 1.6MeV/c?. For triggering of charged events a scintillator hodoscope is used. 


The electromagnetic calorimeter consists of 3100 pure Cesium-Iodide (CsI) crystals 
with a radiation length of Xo = 1.85cm. In the inner region the crystal size is 2.5 x 2.5 cm? 
and in the outer region 5.0 x 5.0 cm?. Two beam holes of 15 x 15 cm? allow the kaon beams 
to pass to the beam dump. The energy resolution at large photon energies above 10GeV 
is 0.7596 as measured with Kez decays. 


Ten lead-scintillator “photon veto" counters are used to detect particles escaping the 
decay volume. The background is further reduced by a muon veto counter consisting of 
4 m of steel and a hodoscope. 
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Figure 10. (a) Data versus Monte Carlo comparisons of z-vertex distributions. (b) 
Linear fits to the data/MC ratio of (a). 


6.5 Analysis of KTeV data 


KTeV has similar physical backgrounds to NA48, except that two-pion decays from As 
mesons produced by incoherent regeneration in the regenerator have to be subtracted. 
Typical numbers for the background fractions are 6.9x10-* charged background from 
Ka and K,3 decays and 27x10 * neutral background from 3x? decays. The background 
levels in the regenerated beam are 107x1074 in the neutral mode, which gives rise to 
a large systematic error, and 7.2x10~4 in the charged mode. The event numbers after 
background subtraction are given in Table 1. 


The main difference in the analysis techniques of the two experiments is the treatment 
of the acceptance correction. KTeV does not use event weighting but uses Monte Carlo 
studies to correct for detection differences between the Kg and K, decays. Ke3 and 37° 
decays are used to model the detector and the agreement between data and Monte Carlo 
is good (see Figure 10). The acceptance correction to R is calculated by Monte Carlo 
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simulation to be (231 -- 13) x 1074. This can be compared to the size of the total effect of 
e'/e on R, 168 x 1074, as measured by KTeV. The main source of systematic uncertainty 
is a slight disagreement between data and Monte Carlo comparison in the m*+m~ decay 
mode in the vacuum beam. A variation of (—1.60--0.63) x 107*m-! has been found which 
is applied as a systematic error. 


The energy scale is determined using the known position of the regenerator edge. The 
comparison of the measured position of the vacuum window with the real position gives 
an uncertainty of the global energy scale which contributes an error of 4.2x10 * on R. 


The result is obtained by a fit of Re(e'/e), the regeneration amplitude and phase to 
the event numbers per energy bin. The result published in 1999 was (Alavi-Harati 1999) 


é/e = (28.0 + 3.0 (stat) + 2.8 (sys) ) x107* (2) 
but this was subsequently corrected due to an error in background subtraction: 


c/c = (23.2 + 3.0 (stat) + 3.2 (sys) + 0.7 (MO) )x10~4 (3) 
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Figure 11. Measurements of the parameter e' /e 


7 Conclusion 


The two experiments KTeV and NA48 have definitively confirmed the original observation 
of NA31 that direct CP violation exists. The results of all published results on e'/e are 
shown in Figure 11. Therefore, the CP violation observed in the K meson system is a 
part of the weak interaction, due to weak quark mixing. Exotic new interactions such 
as the superweak interaction are not needed. With more data, both experiments will 
reach a precision of O(2 x 1074) on e'/e. The theoretical calculations of ¢’/e within the 
Standard Model are still not very precise, but this does not change the main couclusion of 
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the experiments, that ¢' is different from zero and positive, and hence direct CP violation 
exists. 


Unfortunately, this milliweak CP violation in the Kobayashi-Maskawa quark mixing 
matrix is probably not large enough to explain the observed matter-antimatter asymmetry 
from the big bang. An additional, stronger CP violation is needed for this. There are spec- 
ulations that this might be due to CP violation in the lepton sector in the early universe. 
Very heavy Majorana neutrinos could play a role in the formation of an antimatter-matter 
asymmetry at this time (Buchmüller 2001). 
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1 Introduction and short history 


Two experiments, BABAR at SLAC and Belle at KEK, succeeded this year in presenting 
new evidence for CP violation, so that 37 years after its discovery in decays of neutral 
K mesons (Christenson 1964), the effect is now established for the first time in a second 
system, the decays of neutral B mesons (Aubert 2001a, Abe 2001). Both experiments use 
double storage rings with asymmetric energies to produce B mesons in ete” collisions 
on the Y(4S) resonance. These lectures describe these so-called *B-Meson Factories”, 
the experimental detectors, the special features of experiments on the Y(4S), and the 
description of CP violation in quantum mechanics and in the Standard Model. The most 
recent experimental results are discussed. 


Let me start with a short history of CP violation in B meson decays. The b quark was 
discovered in 1977 in high-energy proton-tungsten collisions at FNAL (Herb 1977), in the 
bound bb state T (15) = 15S), with a mass of 9.46GeV. Soon afterwards, the Y (1S) (Berger 
1978, Darden 1978a), and Y(25) (Bienlein 1978, Darden 1978b), were also produced in 
e*e- annihilation with the DORIS storage ring at DESY. Both these states, as well as the 
Y (3S), are not heavy enough to decay into the lightest B mesons, B? (bd) and B* (bu), and 
their antiparticles B? and B-. The next bb resonance, the T (4S) with a mass of 10.58GeV, 
was first produced in Cornell’s ete” storage ring CESR (Andrews 1980, Finocchiaro 
1980). Its decays into B^ B^ and B" B? pairs were first seen as an enhancement in lepton 
production from semileptonic B decays (Bebek 1981, Spencer 1981). 


Soon after the discovery of the T (15), it was already clear that B mesons could show 
large CP-violating effects (Ellis 1977). Plans to look for these effects experimentally led to 
proposals for B-meson factories with e*e~ luminosities much higher than those achieved 
by the storage rings CESR and DORIS. The first proposal was worked out at the Swiss 
National Laboratory PSI (Eichler 1986, Wacker 1988), but was not funded. During this 
period, the ARGUS experiment at DORIS discovered B?B? oscillations (Albrecht 1987), 
leading to an enhanced interest in B-ineson factories, and in the search for CP violation 
in B-meson decays. The subsequent proposals at SLAC (Hitlin 1989) and KEK (KEK 
1990) were approved in 1993, and construction of the two machines PEP-II and KEK-B 
started more or less immediately. 
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The designs of the detectors BABAR and Belle were finalized and approved in 1995, 
and the first e*e^ collisions in PEP-II took place in July 1998, with the first events 
recorded by BABAR in May 1999. In July 2000, the first BABAR and Belle results were 
reported at the International Conference for High Energy Physics at Osaka. In October 
2000, PEP-II reaches its design luminosity of 3x10?! /cm?/s, and in July 2001 BABAR. 
and Belle discover CP violation in B-meson decays. 


2 PEP-II and KEK-B 


Figure 1 shows a schematic view of the B-meson factory PEP-II at SLAC. It consists 
of a double storage ring for electrons of 9GeV and positrons of 3.1GeV located in the 
2.2km circumference tunnel of the dismantled PEP storage ring. The new electron ring 
uses bending magnets from PEP, but all the other components of PEP-II have been 
built in 1994-1998. The electrons are extracted from the SLAC linac at their nominal 
energy and then transferred into the HER (high energy ring) of PEP-1I. The positrons are 
produced by high-energy electrons near the end of the linac, returned to the linac start 
point, damped by synchrotron radiation in two small rings, extracted from the linac at 
their nominal energy, and then transferred into the LER (low energy ring). The two rings 
intersect at only one point where the BABAR detector is located. 
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North Damping Ring 


Positron Source 


Positron Return Line 
e-gun 


200 MeV 
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Figure 1. Schematic view of the B-meson factory PEP-II at SLAC. 


By August 2001, a peak luminosity of £ = 3.3x109 /cm? /s has been reached, (exceed- 
ing the design value of 3x 10°); this corresponds to 3.6 ete” — T(4S) — BB events per 
second, is obtained with an electron current of 0.88 A, and a positron current of 1.55A. 
Averaged over one day the best luminosity has been 220/pb/day, compared to a design 
value of 135/pb/day. Figure 2 shows the luminosity accumulated up to the end of August 
2001. PEP-II has delivered 45 events/fb, and BABAR has recorded 43 events/fb. Out of 
these, 4/fb were recorded at a center-of-mass energy 40MeV below the Y(4S) resonance 
for background studies, and 39/fb are on the resonance corresponding to 42x10° BB 
events. The asymmetric energies of the two beams cause the BB pairs to be produced 
with a significant velocity in the direction of the high energy beam, unlike in the sym- 
metric storage rings CESR and DORIS. The Lorentz boost of the Centre of Mass system 
is: 

Ip(e*) + ple- )| 
MY (4S) 


ay = = 0.56 . (1) 
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Figure 2. Integrated luminosity of PEP-II up to August 2001. 
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Figure 3 shows the B-meson factory, at the Japanese National Laboratory KEK. It 
is a double storage ring for electrons of 8GeV and positrons of 3.5GeV leading to a 
boost of By = 0.45, which is slightly smaller than in PEP-II. The second difference is 
a beam crossing angle of 22mrad in the interaction region, whereas PEP-II collides e* 
and e^ beams with zero crossing angle. The non-zero angle could make it possible to 
reach higher luminosities. Up to July 2001, KEKB has reached a peak luminosity of 
L = 4.5x109 /cm?/s, but the integrated luminosity of 32.8/fb is slightly lower than PEP- 


II. There is only one interaction region in KEK-B, which houses the Belle detector. 
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Figure 3. Schematic view of the B-meson factory KEK-B at KEK. 
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3 BABAR and Belle 


Figure 4 shows the layout of the BABAR detector (Aubert 2001f). The BABAR Collabo- 
ration consists of about 520 physicists from 9 countries: Canada, China, France, Germany, 
Italy, Norway, Russia, UK, and USA. The detector geometry reflects the boosted event 
topology, with the e*e^ interaction region displaced from the centre of the detector. 
There are six main detector components, a silicon vertex detector (SVX), a drift chamber 
(DCH), a Cerenkov detector (DIRC), an electromagnetic calorimeter (EMC), a magnet 
coil, and an instrumented flux return yoke (IFR). 


cc 


Figure 4. Scheme of the BABAR detector. Abbreviations are explained in the text. 


The SVX reconstructs the decay vertices of the two B mesons. With the boost of 
By = 0.56, the average distance between the production and decay of a B meson is: 
Byer = 260um. (2) 
The production point ete~ — Y(4S) — BB, cannot be reconstructed, and lies somewhere 
in the bunch-crossing region which has a length c; ~ icm. The decay point of a fully 
reconstructed B meson is found with a resolution o; z 70jm, which depends slightly on 
reconstructed final state. Partially reconstructed B mesons, which are used for flavour 
tagging, have a vertex resolution o, œ% 180um. As explained in Section 4, all time- 
dependent B-meson decay information can be obtained from the distance between the 
two decay points, without any knowledge of the BB pair production point. This distance 
has a typical resolution of V702 + 180? = 1904m, dominated by the tagging vertex. This 
is below the typical decay length of 260um in Equation 2, and the B9B? oscillation length: 


27 Bye 
Ama 


= 2.0mm. (3) 
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Figure 5. The Silicon Vertex Detector of BABAR. 


The vertex resolution is achieved by tracking charged decay products with five layers of 
double-sided-readout silicon strip detectors in the geometry shown in Figure 5 


The drift chamber (DCH) tracks charged decay products in three dimensions through 
the magnetic field of 1.5T produced by the solenoid coil. It measures the track momenta, 
Pe, Py, Pz, and their specific ionisation dE/dx. The DIRC determines their velocity by 
measuring the emission angle of Cerenkov light in 2.5cm thick quartz bars. Combining 
momentum and velocity information identifies the mass of the charged particle. The EMC 
measures the directions and energies of photons, identifies electrons, and also determines 
the direction of K? mesons that interact in the calorimeter. The IFR identifies muons, 
and measures the direction of K? mesons that do not interact in the EMC. 


/KLM TOF 


Figure 6. Scheme of the Belle detector. Abbreviations are explained 1n the tezt. 


The Belle detector (Abashian 2001) is shown in Figure 6. The Belle Collaboration 
has about 300 physicists from 14 countries: Australia, Austria, China, Germany, India, 
Korea, Japan, Philippines, Poland, Russia, Slowenia, Switzerland, Taiwan, and USA. The 
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Figure 7. Scheme of Belle’s Silicon Vertex Detector. 


detector consists of a silicon vertex detector (SVD), a drift chamber (CDC), a particle 
identification system (PID), an electromagnetic calorimeter (CsI), and an instrumented 
flux return for Kr and muon identification (KLM). It is quite similar to BABAR, with 
the main difference being in the particle identification. Instead of a DIRC, where the 
Cerenkov angle is measured, Belle uses aerogel Cerenkov counters (EFC), which only 
determine velocity thresholds, and time-of-flight scintillators (TOF), which determine 
slow velocities. Belle's silicon vertex detector has only three layers, with the geometry 
shown in Figure 7. An upgrade to five layers is in progress. 
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Figure 8. A fully reconstructed event B°B® — (v(2S)K9)(D**n-) in BABAR. 


Figure 8 shows a B" B" event with both B-meson decays fully reconstructed in BABAR. 
One B meson decays into D**z- with D** > D?n*, D? + K-*. This final state is a 
flavour eigenstate, i.e. the neutral B was a B? at its time of decay. The other B decays 
into the final state (25) K9 with (2S) > pty, K} — ntr”, which is a CP eigenstate 
that can come from either flavour of neutral B. Such events are very rare, since a single 
B is fully reconstructed, with all its decay products detected and identified in BABAR, 
with a probability of only 107?. 
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4 Special features of BB pair decays 


The Y (4S) is a bb meson with mass m = (10580.0+ 3.5)MeV and width F = (14+5)MeV 
(PDG 2000). The cross section for its formation in e*e^ annihilation is 1.1nb, and it 
decays by a strong interaction into BB pairs, 50% B* B- and 50% B9B? . These final 
states consists of two and only two particles which decay by weak interactions on a time 
scale of 10- 7s. 


The experimental study of these weak decays is influenced by the fact that B mesons 
from the T (4S) come in pairs. The mass of a B meson is 5279MeV (PDG2000), so in the 
center-of-mass system of the T(4S), each B has a kinetic energy of only 10580/2 — 5279 = 
11MeV, corresponding to a velocity 8* = 0.064, and a momentum p* = 340MeV/c. Let 
us look first at the determination of the total decay rate of neutral and charged B mesons, 
through measurements of their lifetimes. 


hy 
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Figure 9. Decay topology of BB pairs and definition of Az. In the example, a B" decays 
into Dnt and the other neutral B into three non-identified tracks. 


Figure 9 sketches the decay topology of a BB pair. The two decay points are measured 
with the help of the silicon vertex detectors, but the production point is not measurable. 
Due to the small value of 8* in the T(45) frame, compared to the boost factor By = 0.56 
in BABAR (0.45 in Belle), we can approximate 


Az = 2 — 2 = Byc(t; — te) = BycAt. (4) 


Therefore, a good measurement of Az provides a measurement of At. The lifetime, r(3*), 
gives the decay distribution of a Bt B^ pair: 


N 
ËN = = e ^ir e 2/7 dt, dí» 
T 
= x e Att e7?5/7 dAt di, , 
T 
dN N e Ar pe e?8Í7 dto/r for tj > te, 
dAt ni e A7. foe 22/7 dty/t for te >t, 
N N 
dN = nm e A/T dAt = 3 e-Ih-t3/7 q(t — ta). (5) 
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Figure 10. Plots of mgs and AE* for B* — J/i K* decays in BABAR. 


Thus there are decay pairs with positive and negative values of At = tı — te, and the 
decay law is symmetric, dN(At) — dN(—At), and depends only on the absolute decay 
time difference, |At|. The first BABAR measurement of 7 uses pairs where one B is fully 
reconstructed with its decay vertex at z1, and the second vertex at zg is obtained from 
the intersection of at least two reconstructed tracks from the other B. 


Bt mesons are fully reconstructed in the decay modes D°xt+, D*9z*, J/ K*, and 
v(2S)K*. B- mesons are reconstructed in the corresponding charge-conjugated modes. 
J/w mesons are reconstructed in their ete” and u* u^ decay modes. Figure 10 shows how 
combinations of J/y candidates and Kt tracks are identified as Bt + J/v K+ decays, 
using the kinematic quantities, “beam-energy-substituted mass": 


mes = V (mras)/2)? — (p5,, + PX? . (6) 


which has to be equal to m(B*) within the experimental resolution of +2.5MeV, and the 
centre-of-mass "energy difference": 


AE* = Ej, + Ex — myas)/2 , (7) 


which must be zero within the experimental resolution of +15MeV. Figure 11 shows the 
mes distribution of all the 227300 B* candidates. Within the signal region, there are 
9396 signal events and 796 background. The At distribution of all events in the signal 
region is shown in Figure 12. The lifetime 7(B*) is obtained by fitting the expectation in 
Equation 5 to the data, folding in the experimental resolution for At. This is described 
in detail in (Stark 2001). The result (Aubert 2001b) is given in Table 1. The lifetime of 
the B? is obtained in a similar way, but in this case the presence of B° B? oscillations has 
to be taken into account. Let us have a quick look at oscillation phenomenology before 
returning to the r(B9) measurement. 


A B? meson produced at t = 0 decays at time t either as a B? or as a B°. Owing 
to second order weak interactions, illustrated in Figure 13, a neutral B meson is a linear 
superposition 

|B) = vi|B?) + vs|B?) (8) 
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Figure 11. megs distribution of all Figure 12. At distribution of all fully recon- 
fully reconstructed B^ decay candidates structed B* decay candidates in the BABAR 
in the BABAR lifetime measurement. lifetime measurement. 

The dotted curve is the background. 


PDG 2000 BABAR New Average 
T(B*)in ps 1.653 + 0.028 1.673 + 0.032 + 0.023 1.662 + 0.023 
T(B?) in ps 1.548 + 0.032 1.546 + 0.032 + 0.022 1.542 + 0.021 
r(B*)/r(B^) 1.062+0.029 1.082+0.029+0.012 1.080 + 0.021 


Table 1. B Meson Lifetime Results (Stark 2001). 
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Figure 13. Diagram describing B*B? oscillations. 


with a time dependence of p, and v; determined by initial conditions and the coupled 


differential equations, 
mi, mi i [Tu Ti 
B : (9) 
mM; M22 Da P» 


i OV 
ot 
The matrices m,; and l';; are Hermitean, with mj; = m% and T; = rz. CPT symmetry 
requires that mıı = Mma = m, Ty, = P? =T. There is also an unobservable phase, so the 
matrix j1;; has only five physical parameters, m, I, |mi?|, Re (Pi2/miz), and Im (F12/m12). 
There are two and only two solutions of equation (9) with an exponential decay law: 
Bh(t) = (pB° + qB)e ^"! yu = img + Ty /2, 
Bit) = (pBD'-aqB')e " — s = im, +P /2. (10) 


= pij Wj where py = 
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The normalisation requirement is |p|? + |q|?=1. Again the phase of q/p is not an 
observable, and the five observable quantities in equation (10) are my, mz, Tg, Ci, and 
\q/p|. Explicit formulae relating these five observables to the five parameters of jj; can 
be found in the literature (Branco 1999). By convention (H=heavy, L=light), we have 


Amg = mg — m, > 0. (11) 


The non-exponential decay laws of the two initial states: 


Vg(0) = B^, vg(0) = BY, (12) 
are given by: ert 4 eit TEST 
Q- © TE po, Ge € "gn 
walt) 2 + p 2 : (13) 
~ Ht mU >—YHE _ p-yt 
vg(t) = e” te" gu? ee po (14) 


2 q 2 


These decay laws include the probabilities P that a B? changes flavour, becoming a B®, 
and vice versa: 


P(B? + B’) = [etm ec Pet — ge (Ur +W cos( Ama + t)| ; 


2 
2 


P(B? => B°) = 


4 
p 
E [en e Tu — ge Tn TOU? cos(Amg . 2] . (15) 


ale Ble 


In the Standard Model, CP and T violation in B°B® oscillations is expected to be very 
small, and these probabilities are almost equal: 


P(B? 5 B?) = P(B? > B®) + 0(1073) © |g/p| = 1+ O(10?). (16) 
The lifetime difference should also be less than a percent: 
Ty =r, sr. (17) 
This leads to the good approximation: 
P(B? 5 B®) = P(B* 5 B°) = e ™[1 — cos(Am,- t)]/2, (18) 
and a corresponding derivation gives: 
P(B? 5 B?) = P(B? 5 B®) = e™[1 + cos(Ama - t)]/2. (19) 
B° B? pairs from T (4S) decays have to be parity eigenstates because of parity con- 
servation in strong interactions. A two-particle state B°(@)B°(m — 0), as sketched in 
Figure 14, is not a parity eigenstate owing to 
P B*(0)B*(x — 8) = B?(x — 0) B? (0). (20) 
Since the parity of the Y (4S) is —1, the allowed state is 


y = B*(8) B'(x — 0) — B(n — 6) B6). (21) 


[uc i 
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Figure 14. B?(0)B?(x — 0) is not a parity eigenstate. 


It remains in this coherent state until the first of the two B mesons decays. The state of 
the second B meson is defined at that time t,. If the first decay is a flavour-specific B®, 
we have: 


B,(ti) = B®, Bj(t;) = B^, B3(t;) = a(t, — 8) BP + Blt, — t) Bo, (22) 


with a(t) and 8(t) from equation (13), where t = t; —t,. Equation 22 describes B? B? pair 
decays also for the case when only the second B decay at time tz is flavour-specific. a(t) 
and (t) then describe the time-changing flavour of the first B which decays at t < 0. If 
both decays are flavour-specific, we have measurements of B9 B^, B? B^, or B? B? pairs as 
a function of t. 

In a recent BABAR measurement of Amg (Aubert 2001c), neutral B mesons arc fully 
reconstructed in flavour-specific decay modes, in the same manner as B+ and B^ in the 
lifetime measurement discussed above. The second neutral B is partially reconstructed 
to obtain its flavour. This flavour measurement, which will be described in more detail 
in the CP violation Section 5.4, has a certain fraction of wrong flavour determinations: 


Nwrong 
w = ——>—_ = 0.20. 23 
Nright + Nwrong ( ) 


From measurements of the decay vertices, z;, 2», and equation (4), we obtain At for 
each reconstrücted pair. Including the wrong flavour assignments, and the oscillation 
formalism with |g/p| = 1, and, = ly =T, we have the number of unmixed and mixed 
pairs, and their time-dependent asymmetry a(At): 


N(B*B) = ne ^t [a — w)[1 + cos(AmaAt)] + w[i — cos(AmaAt)]] ; 
= net D + (1 — 2w) cos(Am4At)] , (24) 
N(B°B°) = N(B'B))- ne T [1 — (1 — 2w) cosAmaAt)], (25) 


N(B°B°) — N(B°B°) 
N(B°B°) + N(B°B?) 


= (1 — 2w) cos(Am,At). (26) 


a(At) 


The time dependences of Equations 24 and 25 are shown in Figure 15a with perfect flavour- 
tagging, w = 0. In Figure 15b the effects of w > 0 and the experimental resolution in At 
are included. The BABAR results are shown in Figure 16. A fit gives: 


Ama = (0.516 + 0.016 + 0.010) /ps. (27) 
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Figure 15. Time dependence of unmixed and mized pairs of neutral B mesons from the 
Y(4S), (a) without and (b) with erperimental resolutions, solid curves = unmixed, dotted 
curves = mized. 
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Figure 16. BABAR result on B9B? mizing with fully reconstructed B mesons. The 
dotted curves indicate the backgrounds. 


BABAR and Belle have also measured Amg using other methods. A clean sample 
of “dilepton” events can be obtained, Y(4S) — £X, where £ is a positive or negative 
high momentum electron or muon. Positive leptons tag the decay of a D^ or a B®, and 
negative leptons a B^ or a B®, with a low mistag fraction w if the momentum cut is high 
enough. In 23 million BB events, BABAR finds 99000 dilepton events (Aubert 2001d), 
with a resolution on the vertex separation, c(Az) % 250um. The asymmetry 


N(£*£7) — N(£4) 


a( A0 = NECI N(EEES' 


(28) 


is shown in Figure 17. Its expectation is different from that in Equation 26 and Figure 16 
because of the presence of ¿+6 pairs from BtB- events. Taking this into account, the 
fit gives the preliminary result 


Ama = (0.499 + 0.010 + 0.012)/ps. (29) 


A compilation of Ama measurements is given in Figure 18. The "world average” of 
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Figure 17. BABAR result on B°B° mizing with lepton pairs. 


ALEPH 0.44640.02040.018 ps" 
{3 + PF prel) 
DELPHI” 0.504+0.01940.012 ps" 
(4+ I prel) 
L3 0.444+0.028+0.028 ps" 
i3) 
OPAI 0.479+0.01840,015 ps” 
(5) 
SLD 0.507+0.02340.019 ps 
(5 prel) 
0.495+0.026+0.025 ps 
BABAR ^ 0.503+0.009+0.011 ps 
(2 prel) 


BELLI 0.463+0.00820.016 ps’ 


ahi 


average of above 0.488+0,009 ps! 


after adjustments 
ARGUS+CLEO 
(Xa measurements) 


0,489+0.032 ps! 


world average 0.488:0.008 ps’ 


Figure 18. Compilation of Ama results. 


these measurements is 
Ama = (0.488 + 0.009) /ps. (30) 


When ARGUS discovered B"B? oscillations (Albrecht 1987), they used a time-integrated 
measurement of dilepton events, which gave: 
(Ama)? 


KE 
21? t 2(Amp)? 


= 0.17 + 0.05, Ama = (0.47 + 0.10) /ps, (31) 


where the value of Amga has been adjusted for the present value of T = 1/7(B°). The 
Standard Model expectation is: 


r 


wl? | Val? (32) 


Am, = GE mE") (8^) B(B')m(Wwy S (5) 
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which relates Ama to the CKM matrix element |Via| with the Fermi constant Gy, a 
parameter ng from perturbative QCD, and a parameter fv B from non-perturbative 
QCD. The Inami-Lim function S and ng = 0.55 40.01 are known with small errors. The 
dominant theoretical uncertainty in the relationship between Ama and |Vig} comes from 
low-energy QCD. For this, we will use the lattice QCD result: 


f(B°)\/ B(B°) = (230 + 25 + 20) MeV. (33) 


Let me conclude this section by returning to the B? lifetime measurement. Adding 
Equations 24 and 25 leads to the result 


N-T 
2 


1 
-T|A¢| Lo 1. 
e dAt, P= ABO (34) 


dN(At) = N(B” B°) + N(B’ B°) = 


which is as simple as Equation (5) for the determination of 7(B*) from B* B^ pairs. In 
the same analysis (Stark 2001, Aubert 2001b), BABAR fully reconstructs 7000 neutral B 
mesons in the modes D)* 57, D+F, D™+aF¥, and J/j K*9, where D+ is either a 
D or a D*. The sample purity of 90% is similar to charged B reconstruction. The other 
neutral B is partially reconstructed to measure the decay vertex, but without requiring 
a flavour determination. The fit to the At distribution gives the results in Table 1 (on 
page 353). The lifetime ratio 7 (B*)/7(B?) is significantly different from unity. This starts 
to be well understood, see (Buchalla 2001) in this School. 


5 CP Violation in B-Meson decays 


5.1 CP violation in the Standard Model 


Understanding CP violation is the main goal of the two experiments BABAR and Belle: 


» Is there CP violation in B decays and how large is it? 
e Can this violation be explained by the Standard Model weak interaction? 


» Does the same explanation hold for the known CP violation in K meson decays? 


'The answers to the second and third questions also require precision measurements of B 
meson decays without CP violation. Note that the discovery of CP violation in A meson 
decays does not mean that CP is violated in the Standard Model, since its origin could 
be in a non-Standard new interaction. 


In the Standard Model, CP violation is a simple property of the Higgs couplings to 
quarks if there are more than two families of quarks. Since the three families of leptons 
and quarks are well established and the Higgs mechanism has been treated in the lecture 
of (Rosner 2001) at this School, I will not go into details here. The essence is that the 
Higgs mechanism mixes quarks in a different way from antiquarks. The Glashow doublet. 
partners of the quark mass eigenstates u, c, and £, are linear combinations a’, s', and 
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b of the mass eigenstates d, s, and b. These combinations are different for quarks and 
antiquarks: 


d = Vad d + Vas © s + Væ b, 
s = Via d+Veg- 8+ Va:b, 
b = Via d+Vig s +Veæ'b, (35) 
d = Vig dt Vis SV, 
$ = VideVi GeV, 
b. 


V = VideVicaeV. (36) 


Here V; is a unitary quark mixing matrix , named CKM after (Cabibbo 1963) and 
(Kobayashi Maskawa 1973). CP violation arises only if Vi # Vj. Since the gauge 
invariance of the Higgs coupling term in the Standard Model Lagrangian does not ex- 
plicitly forbid complex coupling matrices, it is expected to lead to a complex matrix V;;. 
This complexity is a necessary but not sufficient condition for CP violation. Because of 
unobservable phases, we must have: 


Im (Vie Vr VaV) = J #0. (37) 


The quantity J is the same for all i # j and k Æ l (Jarlskog 1985). 


The CKM matrix has four observable parameters. There are various parametrisations, 
with experimentalists prefering that of (Wolfenstein 1984) , 


1-2/2 A AX (p — in) 
V= —A 1— A32 AX? +O(M), — Jg A®X8n. (38) 
AX(1—p-—igj -AX 1 


The four parameters of the matrix, A, A, p, and 7, are 4 of the 18 free parameters of the 
Standard Model. Six unitarity conditions VV! = 1 can be drawn as triangles (Bjorken 
1987) in the complex plane. All six have the same area J/2. The most popular one is 
shown in Figure 19: 


Vaa V + Voa Và + ViaVas 
Võ/ (AA?) — 1 + Via/ (44°) 


0, (39) 
0. (40) 


II 


Measurements of K and B semileptonic decays, which are not discussed here, give 
A = 0.220 + 1%, A = 0.83 + 5%. (41) 


Information on p and 7 comes from several observations, as summarized in Figure 20. 
The figure shows p = pv 1 — à? and 5j = n1 — X, which are different from p and 1j by 
about 2.5%. As can be seen from Equation 39, Vj, = AA3(1 — p — itj), together with 
^b = AM(p — in), fulfill unitarity more accurately than Equation 40. Consequently the 
use of p and 7j will become important when the experiments have reached a precision of 
396 or better. 
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Figure 19. The Unitarity Figure 20. Circular bands for p and n from measure- 

Triangle. ments of |Vus| and |Via|. The inner circles give the re- 
gions of maximum likelihood, the outer ones give one 
and two standard deviations. 


The measurements of [(b — uév) (LEP 1999), and (B — plv) (CLEO 1999), are 
sensitive to |V,,]. The measurement of Ama, discussed in the previous section, and 
the upper limit on B, mixing, Am, (LEP 2000), are both sensitive to |Via|. All these 
results are consistent, and we conclude that, with a significance of >1.70, r Æ 0, and 
|J| = 3x107? 4 0. These CP conserving measurements already show that CP symmetry 
is violated in the Standard Model. The significance will increase when the theoretical 
uncertainties in the circular bands of Figure 20 are reduced. 


The B-Meson Factories have so far contributed to the measurement of Amg, but they 
do not intend to measure Am, which would require running the storage rings on the 
Y(5S) resonance. They will contribute new results on |V| starting next year. 


Two immediate questions arise from Figure 20: 


e Does |J| = 3x107 explain the known value of ex for CP violation in K decays? 


e What are the predictions for CP violation in B meson decays? 


The first answer is yes, as shown in Figure 21. CP violation in K decays has been discussed 
here by (Kleinknecht 2001). The band in Figure 21 follows from |n+-| z |noo| = 2.3x107?. 
The experimental width of the band is less than 196, so the width is dominated by the 
theoretical uncertainty on Bg. Adding in the ey constraint fixes the signs of 7 and J to 
be positive. Figure 22 shows the result of a maximum-likelihood fit to the experimental 
results for |Vus|, |Vig|, and e(K) including theoretical uncertainties in the relationship of 
these measurements to p and 7. 


As discussed here by (Nir 2001), there are three classes of CP violation which can be 
observed in the B-meson system; in B?B? oscillations, in decays (“direct CP violation"), 
and in the interference between oscillations and decay. In the Standard Model with p 
and 7 as fitted in Figure 22, the largest effects are expected in the third class, where the 
decay mode is a CP eigenstate. There are some modes where the prediction for the CP 
asymmetry is very insensitive to theoretical uncertainties, especially 


dN(B° + J/óK9) — aN(B* > J/oK2) 
dN(B° > J/pK) + dN(B° > JJYK$) 


ap — = [sin 28 + O(1%)|sin(AmgAt), (42) 
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Figure 21. The hyperbolic band shows the relation of p and Figure 22. Fit result 
3j to the measured value of €x. for p and m. 


where has been defined in Figure 19: 


2n(1 — p) (43) 


sin 28 = =, 
28 = + (= py 


and At is the difference between the decay time into J/YK, and the decay time of other 
B in the B?B? pair. Before coming to the BABAR and Belle results for aș, I would like 
to discuss briefly the other two classes. 


5.2 CP violation in B?B? oscillations 


The two mass and lifetime eigenstates in the neutral B system have been defined in 
equation (10). CP conservation in B? B oscillations is equivalent to |g/p| = 1. If CP is 
violated, we may use the Kaon language for p and q, 


2Re (ep) 
1+ ļegl? 


z PEE 
p-4 p_1t+ep f (44) 


EB M 774 ME ee , 
ptq q l-es q 


Because there are several e parameters in Kaon physics, we choose the notation eg 
for c in mixing. The imaginary part of eg is unobservable. Since the Standard Model 
expects Re (eg) to be small, as ex is in K?K? oscillations, a useful convention is to take 
Im (cg) = 0, giving |p/q| = 1+ 2Re (eg). 

CP violation in oscillations shows up as a different number of B*B? and B"B? decay 
pairs. A straightforward calculation gives 


dN(B°B°)(At) - dN(B°B)(At) _ 4- Re(eg) 


ag (At) = dN(B° B°)(At) + dN(B°B?®)(At) EET les? ` 


(45) 


This asymmetry is independent of At. Table 2 gives results for Re (eg), including a recent. 
BABAR result (Brandt 2001) which uses 21 million Y(45) events. No CP violation is 
seen. This is not surprising since the Standard Model expectation is below the sensitivities 
reached by current experiments. 
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Experiment Reference Method Sample €g/ (1 + lep|?) 
OPAL (Ackerstaff 1998)  £* £* /£- £7 3M Z? 0.002 + 0.007 + 0.003 
OPAL (Abbiendi 1999) inclusive 4M Z? 0.001 + 0.014 + 0.003 
ALEPH (Barate 2001) combined 4M Z? —0.003 + 0.007 
CLEO (Jaffe 2001) LOJO 10M Y(4S) 0.004 + 0.010 + 0.002 
BABAR (Brandt2001) £*£*/t-£- 21M Y(4S) 0.0012 + 0.0029 + 0.0036 


Table 2. Searches for CP violation in B°B® oscillations. 


5.3 CP violation in decays 


BABAR, Belle, and CLEO have searched for direct CP asymmetries in various decay 
modes. As explained in (Nir 2001), the decay amplitude A(B — f) with B = B+ or D? 
has to be described by at least two diagrams, with different weak phases and with different 
strong phases, in order to show a direct CP asymmetry: 


r(B > f) - (B 5 f) 


ace) = FBSA) XT(B o f) 


(46) 


Table 3 shows the limits on asymmetry values for five decay modes. For details and 
references, see (Hamel de Monchenault 2001). Again, no indication for CP violation is 
seen, but this is not in contradiction with Standard Model expectations. 


Mode BABAR Belle CLEO 

Ktn —0.19 + 0.10 +0.04 +0.19 — 0.17 —0.04 + 0.16 
K+’ 0.00 + 0.18 —0.06 4-0.22 — 0.20 —0.29 + 0.23 
Kant —0.21+ 0.18 +0.10 +0.43 — 0.34 —0.18 + 0.24 
J/bKt  —0.009 + 0.027 + 0.005 +0.018 + 0.043 + 0.004 
K*y —0.035 + 0.076 + 0.012 +0.08 + 0.13 + 0.03 


Table 3. Search for direct CP violation in hadronic and radiative B decay modes. The 
K* result is an average of K*+ and K*°. 


5.4 CP violation in interference 
This third class of CP violation requires decays of neutral B mesons into CP eigenstates, 
lf) nif) with np =+lor -1. (47) 


Examples are f = ntr, where y; = +1, since the two pions are in a state with £ = 0, 
P = +1, and f = J/wK&, where n; = —1. The notation J/wK¢ is a short-hand: in reality 
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K$ and J/YK?} are not CP eigenstates, and what are observed are the decay cascades: 

B+ JhpK? with K? > rtr, 

B? JhpK? with  K9*—m*m, (48) 
where the reconstructed state J/v(m*m ) with mz, = mx is a strict CP eigenstate. The 
CP eigenvalue is —1 because CP(J/v) = +1, CP(am) = +1, and £ = 1 between the 
J/'b and the v7 state is required by angular momentum conservation with J(B) = 0, 
J(J/v) = 1, and J(xz) = J(K) — 0. 

We now introduce the decay amplitudes 


(GIT|B") = Aj, UIT[B) = Aj. (49) 


Using the notations of equations (13) and (14), the time-dependent decay rates dN and 
dN of initial flavour states ig and Vg are given by 


2 


—YHt -yL —yHt — eTit 
dN(t) = — n a —Àr ; (50) 
— e Bp eTit __ p eg 1HÍ e~mt 2 
dN (t = A K 

(t) 5 up 5 f (51) 
Using the abbreviated notation: u 
A 

= 11 (52) 


Typ = , 
1 pA; 
and making the approximations [y =T; =T and |g/p| = 1, a straightforward calculation 
gives: 


JA -T|At| 1 - |r)? 2Im (r) 
dN(At) = —— 1 Amg,At) — sin(Amg At)! , 
(At) inm: + 13 P cos(Am,At) 14 InP sin( Amg At) 
IAs? ira 1- [ng 2Im (ry) . > 
dN(At) = —3 € [PAt| 1— IF [r cos(Am,At) + I+ Ir; sin(Am,At) . (53) 


For the case f = J/wK3, this gives, with a precision of better than 1% (Nir 2001): 
rp =e? (54) 


and we obtain, with the proper normalisation of the decay amplitude Aj: 


Te TIAd 
dN(At) — 5 [1 — sin(28) sin(AmgAd)] , 
— Te-TlAtl 
dN(At) = 5 [1 + sin(28) sin(AmaAt)] , 
aj(At) = TN dx = sin(28) sin(AmaAt). (55) 


The integral of a; over all At is zero. Therefore, a time-dependent measurement. of 
the two rates is necessary in order to obtain sin23. The experimental determination of 
sin 26 requires eight steps: 
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1. Reconstruct J/: mesons in their ete~ and utu decays. 


2. Reconstruct neutral K mesons in their rt~ decays. 


3. Reconstruct neutral B mesons from the J/y and K candidates. 


4. Partially reconstruct the second 
"tags" the flavour of the first B 


B meson of the pair in a flavour-specific way. This 
at the time of decay of the second B. 


5. Reconstruct the z coordinates of the two decay vertices to obtain Af for each event. 


6. Determine the fraction w of wrong flavour tags. 


7. Determine the vertex resolution function R(At — At) between reconstructed and 


true At values 


8. Determine sin 28 from a fit to the two time distributions dN,,(At) and dN,(Af). 
(Here dN,, = (1 — w) dN + wdN and similarly for dÑ „). 


In the BABAR experiment (Aubert 2001a) with a sample of about 32 Million BB 


pairs, there are 380 reconstructed and 


tagged J/V K9 events, including about 15% with a 


K — ns? decay. Steps 1, 2, and 3 have already been discussed in the lifetime section. In 
addition to J/Y K}, BABAR has used other charmonium CP eigenstates in their sin 28 
analysis, v(2S)K9 and Xa K9 with ny = —1, J/pK? with ny = +1, as well as J/ K9* 
with K9* — K$x° which is a mixture of CP eigenstates with ny = —1 and +1, but with 


a well-determined effective value of np 


Events / 2.5 MeV/c ? 


728 events 
purity 9496 


Beam-Energy Substituted Mass (MeVic 


Figure 23. Mass distribution for all B 
decays into CP eigenstates with a K} in 
the BABAR sin28 measurement. The 
dotted curve shows the background. 


= +0.65 + 0.07 (Aubert 2001e). 
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Figure 24. Distribution of AE*, see tert, 
for the B —^ Jf K0 candidates in the 
BABAR sin28 measurement. 


The modes with n; = +1 have ry = —e??, and an asymmetry a; with the opposite 
sign. The mes distribution of all events in modes with a K? is shown in Figure 23. The 


signal of the J/ K$ mode is shown in 


Figure 24. The EMC and IFR measure the direc- 


tions of K? mesons, but not their energies. Using the m(B?) requirement to constrain the 
K? energy, leaves only one kinematic constraint, AE*—E*(J/)d-E*(K2)—m(Y4S)/2, 
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which is shown in Figure 24. The background in the Kz sample is much higher than in 
the Ks sample. Background levels are often expressed as purities: 


Naignai 


P = ———— 
Nsignal + Noackground 


(56) 


The event numbers and purities in Figures 23 and 24 are those before flavour-tagging. 


Flavour-tagging is achieved by requiring a second B decay vertex with high momentum 
electrons, high momentum muons, charged Kaons, or other flavour-specific combinations 
determined using a neural net algorithm. In order to maximize the tagging rate, tags with 
only one track are accepted, and their decay coordinate z2 is given by the intersection 
of this track with an assumed B flight line. The neural net tags use poorly identified 
leptons, slow pions, and fast hadrons. They are grouped into two classes NT1 and NT2. 
Table 4 gives the tagging efficiency ¢,, the mistag fraction w;, and the quality factor Q; 


Tag category ^ e; (96) wi (96) Q; (96) 


Leptons 10.9403 891.3 7.44+0.5 
Kaons 35.8+0.5 17.6410 15.0+40.9 
NTI 7.8+0.3 220421 2.50.4 
NT2 13.8403 35.1+19 1240.3 
Sum 68.4 + 0.7 26.1 + 1.2 


Table 4. Flavour-tagging in the BABAR experiment. 


for each tag category i. The quality factors are defined as 
Qi = 6 (1—2u,)’, Q = Di. (57) 


They determine the statistical error of the fit result: 

1.9 

-m (58) 
NaPQ 


where Na, is the number of signal events before tagging. Table 5 gives the number of 
tagged events and their purity for the BABAR experiment (Aubert 2001a). 


e(sin28) = 


Step 5, the determination of A£ with equation (4), is the same as in the lifetime and 
oscillation measurements. Step 6, the determination of the mistag fractions, uses the same 
sample of fully reconstructed neutral B mesons with identified flavour as in the oscillation 
measurement. The 9400 events of this "flavour sample" are tagged with the same four 
tagging categories as the “CP sample". Since their flavour is known, the tags are easily 
classified as right or wrong. The results are given in Table 4. 

Step 7, the determination of the time resolution, is achieved by simultaneously fitting 
parametrized resolution functions R;(A£ — At), which are different for cach tagging cat- 
egory i and each decay mode, to both the flavour sample and the CP sample. This is 
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Mode N P(9) 
J/pKY(ntn-) 316 98 
J[éK9(n9n9) — 64 94 


v(2S)K9 67 98 
Xa K$ 33 97 
J/y k*? 50 74 
J/é Kj 273 5l 
Sum 803 80 


Table 5. Number and purity of tagged events in the BABAR experiment. 
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Figure 25. (a) and (b) show the time dependences for the flavour sample with unmized 
(solid curves) and mixed events (dashed curves). (c) and (d) show these curves for the CP 
sample with B? tags (solid) and B? tags (dashed). (a) and (c) are without At smearing 
and without mistags, (b) and (d) with smearing and mistags. 
illustrated in Figure 25. For each i, the observed Aé distributions can be described by 
dN,((A) = feria + Di cos(AmaAt)] Rj(AE— At) dAt, 
dN,(Ai) = f e T4411 — D; co(AmaAt)] (AE — At) dAt, (59) 


for the “unmixed” flavour sample events dN, with the opposite-flavour tag, and the 
“mixed” events dN; with the same-flavour tag, and by 


dN;(At) = fe Till + Disin 28 si(Ama At) R(AL- At) dAt, 


dN,(At) = f e "Alt — D; sin 28 sin(Am,At)|Ri(At — At)dAt, (60) 
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Figure 26. Rates and asymmetry for the K} modes in the BABAR measurement 


for the CP sample events with a B" and a B? tag, dN3 and dN4, respectively. The 
*dilution" factors are D; — 1 — 2wj. Since the resolution functions R; are the same for all 
four distributions, the fits to the high statistics samples 1 and 2, fix their shapes in the 
low statistics samples 3 and 4. 


This combined fit in step 7 is in practice performed together with steps 6 and 8, which 
determine the mistag fractions w;, and the value of sin 28. The results are 


sin28 = 0.56 x 0.15 for the Ks modes, 
= 0.70 0.34 for J/YK,, 
0.82 + 1.00 for J/pK, 


where the errors are statistical only. A combination of these results leads to 
sin 28 = 0.59 + 0.14 + 0.05. (61) 


The systematic error is dominated by the knowledge of the resolution function for Ks 
modes, and by the knowledge of the CP properties of the background for the K; mode. 
The rates and the asymmetry a; for the Ks modes are shown in Figure 26. If the condition 
jrs] = 1 is not imposed in the fit, the data can be fitted to the expressions in equations 
(53). This fit, using all modes, gives: 


rj] = 0.93 + 0.09 + 0.03 . (62) 


Hence there is no indication for direct CP violation in the decay modes used for the sin 28 
measurement. 


In the Belle experiment (Abe 2001), 31M BB events are analysed in a very similar way. 
The same CP eigenmodes are reconstructed as in BABAR, plus an additional mode, rj Ks, 
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Figure 27. Signal and background in the J/WK? sample of Belle's sin23 measurement. 
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Figure 28. The sin2 result of Belle. (a) Rates for all tagged events; B? tags for 
nf = —1 modes are combined with B? tags for yy = +1 and vice versa. (b) Asymmetry 
ay for modes with np = —1, (c) with ny = +1, (d) combined asymmetry. 


which has n; = —1. Figure 27 shows Belle's signal for the mode J/wA?, whose purity of 
61% is slightly better than BABAR. Another difference is in the tagging strategy, where 
Belle tries to tag each event, and uses all 1137 reconstructed events of their CP sample in 
the final fit. Their “tagging efficiency" ise = 1 compared to e = 0.68 in BABAR, but their 
overall tagging quality is Q = 0.270 + 0.012, which is very similar to Q = 0.261 + 0.012 
in BABAR. Belles’s final fit, combined all modes, is shown in Figure 28. Their result is 


sin 26, = 0.99 + 0.14 + 0.06 , (63) 


where ¢; is the Japanese name for f. 


B-meson factory results 369 


5 +041 
dii us 
+0.82 
0.84", ,, t0.16 
+043 + 0.07 
Omas BRS east 
(0 Mors 0.12+ 0.37+0.09 
saka, 
+0.32 + 0.09 
(11.2M SBS 0-58-94 -0.10 
Q3M BB) 0.34+ 0.20+0.05 
e1 3M Bus 0.99+ 0.14+0.06 
2M BB) 0.59+ 0.14+0.05 
pees s 0.79+ 0.10 
expt. 
-0.5 


sin29, 


Figure 29. Compilation and average of sin 20 measurements. 
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Figure 30. Fit result from Figure 22 and lines of constant Figure 31. Final fit 
likelihood (maz, X:1o, +20) for sin 28. to all observations in- 
cluding sin 28. 


Figure 29 compiles all results for sin 28. The world average is 
sin 28 = 0.79 + 0.10, (64) 


which is in very good agreement with the p/ņ region favoured by the measurements of 
\Vasl, |Via], and e(K?), as shown in Figure 30. A common fit to |Vis|, |Vial, «€(&9?), and 
sin 28 gives the region in Figure 31. With one-standard-deviation errors, this region has 


= d03-E012. 
= 0.44 0.06, 
(2.8 + 0.6)x 107 . (65) 


V EOS 
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Figure 32. Tree (a) and penguin (b) diagram for xt and w^ decays of B mesons. 


BABAR has also performed a first measurement (Aubert 2001g) of the angle œ in 
the Unitarity Triangle (see Figure 19 for its definition). sin 2a is obtained from the time 
dependence of tagged ata” decays. If only the tree diagram of Figure 32a contributes to 
these decays, we would have in complete analogy to Equations 54 and 55 


Tatr- = e^ ; 
LE | = 1, 
artz- = sin2asin(AmgAt), (66) 


where a,4,- is the asymmetry between dN, the number of rta decays with B tags, 
and dN, the number with B^ tags. However, there is a contribution from the penguin 
diagram in Figure 32b, which has a different weak and a different strong phase. This leads 
to direct CP violation, |r,+,-| Æ 1, in addition to CP violation in the interference between 
oscillations and decays. This means Im (r,+,-) Z sin 2a, and instead of equations (55), we 
have to use equations (53) for the expected time dependence, leading to the asymmetry: 


arta- (A) = —C,+,- cos(AmgAt) + Srtr- sin(Am4At), 
_ 1- Ires - |? 
Cr TE re E 
2 - 
Sau = 2m (T+) = sin (2an). (67) 


14 [ra]? 


Using the same 32 M BB events as in their sin 20 analysis, BABAR finds the preliminary 
result 


Cu. = -025705-0.4, 
S, = 003909011. (68) 


There have been theoretical attempts (Beneke 2001, Luo 2001) to relate sin(2o,g) to 
sin(2a), using additional measurements of Kr and rr decay rates of neutral B inesons. 
The central value of the BABAR result, sin(2a.q¢) = 0.03, gives aeg = 1? or 89°, which 
they translate into o close to zero or equal to 75°. The first value does not agree with 
the preferred p/1) region from all our previous discussions, but the second one agrees very 
well. Using the one-standard-deviation errors of the BABAR result in Equation 68 and 
the aeg to a translation curve in (Luo 2001): 


a = (75 + 18)°. (69) 


This result is drawn with a central and two one-standard-deviation circles in Figure 33. 
There are no 20 contours because the BABAR result still allows any value of a within 
two standard deviations. 
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Figure 33. Contours of constant likelihood (maz, +10), from the BABAR sin2oq 
measurement. 


6 Conclusion 


CP violation in the B meson sector has been firmly established in 2001 by two experi- 
ments, BABAR at SLAC and Belle at KEK. Their observation of ay Æ 0 in charmonium 
final states, has a combined significance of more than seven standard deviations. It is com- 
patible with the Standard Model explanation of CP violation in the Higgs mechanism. It 
is also compatible with the CKM matrix parameter values of p and 7 as determined from 
measurements of |Vio|, |Via|, and «(K). 


With the combined efforts of theorists and experimentalists, the next years will show 
how precisely the measurements will determine p and 7, and if the five independent con- 
straints on p and 7 from |V,s|, |Vi4|, (K), 8, and o will establish a consistent description, 
or will falsify the assumption that all weak processes including CP violation are well 
described by the Standard Model. 


Acknowledgments 


I thank K Peach and all his co-organizers of SUSSP55 for inviting me to this excellent 
School and for their friendly hospitality. 


References 


Abashian A et al (Belle), 2001, "The Belle Detector", submitted to Nucl Instr Meth A, KEK 
Progress Report 2000-4 

Abbiendi G et al (OPAL), 1999, Eur Phys J C, 12 609 

Abe K et al (Belle), 2001, Phys Rev Lett 87 091802 

Ackerstaff K et al (OPAL), 1998, Eur Phys J C, 5 379 

Albrecht H et al (ARGUS), 1987, Phys Lett B 192 245 

Andrews D et al (CLEO), 1980, Phys Rev Lett 45 219 

Aubert B et al (BABAR), 2001a, Phys Rev Lett 87 091801 

Aubert B et al (BABAR), 2001b, Phys Rev Lett 87 201803 


372 Klaus R Schubert 


Aubert B et al (BABAR), 2001c, submitted to Phys Rev Lett, hep-ex/0112044 
Aubert B et al (BABAR), 2001d, submitted to Phys Rev Lett, hep-ex/0112045 
Aubert B et al (BABAR), 2001e, Phys Rev Lett 87 241801 

Aubert B et al (BABAR), 2001f, submitted to Nucl Instr Meth, hep-ex/0105044 
Aubert B et al (BABAR), 2001g, submitted to Phys Rev Lett, hep-ex/0110062 
Barate R et al (ALEPH), 2001, Eur Phys J C 20 431 

Bebek C et al (CLEO), 1981, Phys Rev Lett 46 84 

Beneke M, Buchalla G, Neubert M and Sachrajda C T, 2001, Nucl Phys B 606 245 
Berger Ch et al (PLUTO), 1978, Phys Lett 76 243 

Bienlein J K et al (DHHM), 1978, Phys Lett 78 360 

Bjorken J D, 1987, presented at the Workshop on Experiments for the Supercollider, Berkeley 
Branco G C, Lavoura L and Silva J P, 1999, “CP Violation", Oxford University Press 
Brandt T (BABAR), 2001, presented at 9th Int Symp on Heavy Flavour Physics, Pasadena 
Buchalla G, 2001, these Proceedings 

Cabibbo N, 1963, Phys Rev Lett 10 531 

Christenson J H, Cronin J W, Fitch V L and Turlay R, 1964, Phys Rev Lett 13 138 
Darden C W et al (DASP-2), 1978a, Phys Lett B T6 246 

Darden C W et al (DASP-2), 1978b, Phys Lett B 78 364 

Eichler R, Nakada T, Schubert K R, Weseler S and Wille K, 1986, SIN-PR-86-13 
Ellis J R, Gaillard M K, Nanopoulos D V and Rudaz S, 1977, Nucl Phys B 131 285 
Finocchiaro G et al (CUSB), 1980, Phys Rev Lett 45 222 

Hamel de Montchanault G , 2001, Proc Europhys Conf High Energy Phys, Budapest 
Herb S W et al, 1977, Phys Rev Lett 39 252 

Hitlin D et al (SLAC, LBL, Caltech), 1989, SLAC-352 

Jarlskog C, 1985, Phys Rev Lett 55 1039 

Hitlin D et al (SLAC, LBL, Caltech), 1989, SLAC-352 

Jaffe D E et al (CLEO), 2001, Phys Rev Lett 86 5000 

KEK Laboratory, 1990, Task Force Reports KEK-90-23 and KEK-90-24 
Kleinknecht K, 2001, these Proceedings 

Kobayashi M and Maskawa T, 1973, Progr Theor Phys 49 652 

Luo Z and Rosner J L, submitted to Phys Rev D, hep-ph/0108024 

Nir J, 2001, these Proceedings 

PDG (Particle Data Group), 2000, Eur Phys J C 18 1 

Rosner J L, 2001, these Proceedings 

Spencer L J et al (CUSB), 1981, Phys Rev Lett 47 771 

Stark J (BABAR), 2001, Ph D Thesis, Université Paris VI, in French 

Wacker K et al (PSI), 1988, PSI-PR-88-09 

Wolfenstein L, 1985, Phys Rev Lett 51 1945 


373 


B physics at the LHC 


Tatsuya Nakada 


CERN and University of Lausanne, Switzerland 


DOI: 10.1201/9780429187056-10 


1 Introduction 


Since its discovery, the b quark has brought us two big surprises. The first was the un- 
expectedly large lifetime. The second was the mass difference between the two mass 
eigenstates of the B4 meson system which is about 100 times larger than the mass dif- 
ference in the neutral K meson system. From the first observation, we learned that the 
mixing between the second and third families is much smaller than between the first and 
second families. (The mixing between the first and third families is even more suppressed.) 
The second observation taught us that the mass of the top quark is much larger than had 
been anticipated at that time. 


The main goal of B physics is to study the structure of quark mixing and its role in 
CP violation. In the Standard Model CP violation can be very naturally accommodated 
through the complex quark mixing matrix (Kobayashi and Maskawa 1972) defined by four 
parameters (Nir 2001). All current observations of CP violating phenomena in particle 
physics are in full agreement with the Standard Model calculations. However, there are 
still some reasons to speculate about CP violation generated by physics beyond the Stan- 
dard Model. Firstly, the Standard Model alone cannot account for the large asymmetry 
between matter and antimatter observed in our universe (Shaposhnikov 1986). Secondly. 
various extensions to the Standard Model introduce new sources of CP violation. Since 
CP violation is expected in many B meson decay modes, and the Standard Model can 
make precise predictions for some of these decay modes, the B meson system appears to 
be a very attractive place to look for evidence for physics beyond the Standard Model. 


In the presence of new physics, some assumptions made to extract the Cabibbo- 
Kobayashi-Maskawa (CKM) parameters are no longer valid. Indeed, the consistency of 
the CKM picture with current observations could be accidentally due to numerical can- 
cellation between various effects from new physics. Therefore, it is essential to develop a 
strategy which allows the new physics and the Standard Model contributions to be clearly 
disentangled. Then the CKM parameters can be determined in à model-independent way. 
This is important since we hope that physics at higher energy scale will one day be able 
to explain the family structure, by deriving the CKM parameters. 
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In this article, we discuss how B meson decays can be explored at the LHC in order 
to obtain a better understanding of the CKM picture, and to look for physics beyond the 
Standard Model. 


2 The CKM picture 


2.1 The CKM matrix 
In the Standard Model, CP violation is naturally introduced by a 3 x 3 complex quark 
mixing matrix, Vckw, expressed as 

Vua Vus Vab 

Vekm = | Vea Ves Veo 

Va Vis Ve 

The charged current of the weak interaction is then proportional to 
U, — 55) Vig DLW" 

where U; and D, are the left-handed quark operators for the charge 2/3 up-type and the 
charge —1/3 down-type quarks respectively: 


UL d; 
Ur = {ci tf; D, —l8L 
ty br 


The matrix Vckw is unitary, VIV = 1. One of the unitarity relations is: 
Vaa Va + Voa Va + Via Vis = 0. (1) 


This is illustrated in the complex plane in Figure 1(a). The unitarity condition can be 
illustrated more easily by translating V,4V,? and VV, to form a closed triangle, as shown 
in Figure 1(b). The three angles of the triangle, a, 8 and y (also known as $», $1 and $3 
respectively) can be defined as: 


V Vx roy a p* 
a = tan! C i i) B =r — tan`! KS) ; y= tan”! L s) . (2) 


7 * j? * 
Vaa Vib Voa Vab 


7 q7% 
Veal cb 


a) b) 


Figure 1. (a) One CKM matriz unitarity condition: VuaV, + VeaVa + Vag = 0. 
(b) Elements translated to form a triangle drawn in the complex plane. 
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It should be noted that a redefinition of the quark phases results in a rotation of the 
triangle in which the three angles, a, 8 and y remain invariant. 


Violation of the unitarity condition given by Equation (1) is expressed in a graphical 
form in Figure 2(a) and (b), where the three sides given by VuaV,4, Vaa Vá and Vi4V, do 
not form a closed triangle. Note that a, 8 and y defined by Equation (2) still satisfy: 


a+Bp+y=n. 


If one forms the closed triangle from the length of the three sides, |Vaa Vä], |WeaV.i| and 
|V Vz |, the three angles of this triangle, o^, 6’ and 4' defined in Figure 2(c), are not 
identical to a, 6 and y. Therefore, a test of unitarity can be made by comparing the 
angles defined by the length of the three sides, and the angles measured by CP violating 
effects (as explained later). 


b) 


c) 


| Vod Vub | 


[Vea Veo | 


Figure 2. (a) and (b), are similar to Figure 1, but unitarity is violated. In (c) a closed 
triangle is formed using the three sides. 


A unitary 3 x 3 matrix can be parameterized by four parameters. One possible choice 
(Particle Data Group 2000) is to use three angles, 612, 923, 63 and one complex phase, 6. 
The standard parameterization for the CKM matrix is then given by: 


VckM = Rz X Rig X Rye 


where 
C2 813 0 1 0 0 C13 0 size t? 
Ri2 = | —sı2 c 0 Ro = | 0 — €» 52 R = 0 1 0 
0 0 1 0 —$93 C23 — 5379 0 C13 


with si; = sin 8j; and ci; = cos bij. 
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A parameterization reflecting the observed pattern of the CKM matrix was proposed 
by Wolfenstein (Wolfenstein 1983). This introduces the parameters: 
813 COS Ô S13 Sin Ó 


$23 
A = 812, A=> p=——_, n= ; 
$15 $12823 512823 


and expands the matrix elements in powers of A. Neglecting terms proportional to A” 
where n > 5, this gives: 


1- X2 A AX (p — in) 
Vorm © | A — i A208 1— 2 AM (3) 
AM(1—p—i}) —-AM -iAMm 1 


where p and fj are given by p = p(1 — A?/2) and 5j = n(1 — A?/2). The parameter A is 
known from light hadron decays to be 0.221 + 0.002 (Particle Data Group, 2000). As seen 
from Equation 3, the first 2 x 2 sub-matrix is almost unitary: 


VaV + Vas Va = iA? dn = 0 VaVe + VaVa = iA ~ 0 


and 


IVa + |Vas =1—A*/4~1 IVa? + |Ves|? = 1— M/AmI 


7 
Vis 


With the parameterization given in Equation 3, the imaginary part of V.q becomes negligi- 
ble in the unitarity relation given by Equation (1), and the phases of the matrix elements 
are: 

arg Via = —B, arg Vib = —7; 


arg Vig = arg Væ = arg Va = 0, arg Vig = T. 


Vas and V,, are also real, but the imaginary part of V,, cannot be completely ignored: 
arg Vis = OY +7 
The angles 8, y and dy are functions of p, 7 and A: 


B —tan! I y= tan! n ôy = tan^! A?n. (4) 
1-p p 


2.2 Extraction of the CKM parameters 


From a measurement of |V,,|, the Wolfenstein parameter A can be determined. The 
matrix element |V,4| is extracted from semileptonic (and hadronic) decays of B mesons 
into charmed meson final states (Stone 2001). In exclusive B meson decays, a description 
based on the quark tree level process b — c+W - is obscured by soft hadronic interactions. 
This has to be taken into account in order to extract |Vi4| from the data. Significant 
improvements in understanding these hadronic effects have been made both in theory 
and experiment. Further progress will come using the high statistics data samples from 
the BABAR and BELLE experiments (Schubert 2001). 


The semileptonic decays of B mesons to final states containing only the light u and 
d quarks are used for the extraction of |V,,| (Stone 2001). They are generated by the 
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tree level process b + u+ W^. Unlike the determination of |V|, hadronic decays cannot 
be used since sizeable contributions from the penguin processes, b > s and b — d, are 
present in the decay amplitudes. It is much more difficult to evaluate the effects of strong 
interactions for |V,»| than for |V.|, and the error on the current value of |Vij| is totally 
dominated by the theoretical uncertainties. This will remain the case for some time. 


While |V4| and |V;j| are determined from decays with tree processes, |Vig} can be 
accessed indirectly through loop processes such as B?- B oscillations. The oscillation in 
the Standard Model is described by the well known box diagrams shown in Figure 3. Due 
to the large top quark mass and the structure of the CKM matrix, only the top quark 


———————————— Orr 


"ow ift dw f ho: V ju d 
d Tt | b d S w sI b 
a —————À——— «* ———————— (000 ———— ---eeMeec-- 

B? B? B? B? 


Figure 3. Bor diagrams describing B? -B? oscillation. 


contribution needs to be considered in the loop. Neglecting the absorptive part of the box 
diagrams, the oscillation amplitude for B? — B? is calculated to be (Hagelin 1981): 


2 £2 2 
Gris, Bp,mg,my, 


Ha = — 1272 


"na, S (E) (VV )? (5) 


where Gr, mw and mg, are the Fermi coupling constant, W boson mass and Bg mass. 
The function S(zx,) is determined from the mass ratio, z, = (m:/my)?, where m, is the 
top quark mass, and the QCD correction factor, ng, can be calculated reliably with per- 
turbation methods. The B meson decay constant, fg,, and the bag parameter, Bg,, have 
never been measured directly. The decay constant fp, is given by the transition matrix 
element between B? and the hadronic vacuum state. Thus it could be experimentally ob- 
tained from the branching fraction for the leptonic decay, B+ — r^v, shown in Figure 4, 
assuming that |V,»| is known. The decay constants for B* and B? are expected to be very 


Figure 4. Diagram for the Bt + r*v, decay. 


similar. The bag parameter takes into account the difference between the hadronic vac- 
uum and the actual hadronic states virtually present in the B°-B° oscillation processes. 
Theoretical estimates exist with large uncertainties due to the difficulties in evaluating 
the effects of the non-perturbative soft hadronic interactions. The most promising theo- 
retical approach to obtain fg, and Bp, is the QCD lattice calculation discussed in these 
proceedings (Davies 2001). 


The absorptive part in the B°-B® oscillation is due to c and u quarks. It is calculated 
to be very small compared to the dispersive part and can indeed be ignored. With this 
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approximation, the mass difference between the two mass eigenstates, BẸ and BŁ, can 
be derived, 
— „H L 
Ama E Mp, — MB; = 2| Hail, 


and the decay width difference between them is negligible: 
_ H 
AT, TL -TÉ 
There is no CP violation in the B°-B® oscillations in this approximation. Note that the 


phase of the oscillation amplitude is given by the phase V = —2f. 


The mass difference Am, is experimentally measured as the frequency of B°- B? os- 
cillations. From the known structure of the CKM matrix, we have |V»| = 1 with very 
high accuracy. Therefore, |Vi4| can be obtained from Ama. The error on |V] is totally 
dominated by the theoretical uncertainties on fgv Ba. 


From the measurements of Vas, Væ and Vig, the quantities: 
1|Vial X 
pr + 1-— 6 
Ve + iP = + Val 5 (6) 


1 [Veal 


(AP +i = ipt (7) 


and 


can be drawn as circles around (0, 0) and (1, 0) in the -ñ plane respectively. There are 
two possible solutions where these circles intersect, but the one with 7 > 0 is favoured 
from the analysis of CP violation in the K°-K° oscillations. 


The mass difference in the B, system, Am,, can be evaluated by calculating the 
B9-B? oscillation amplitude using box diagrams in a similar way to the By system. A 
measurement of Am, determines |V; V; |7, which allows: 


(8) 
to be used instead of Equation (7). In the ratio: 


_ 1JA , Vn B,T]B, x VBs.fs. 
A [Ves] A Ama "V TB, T]B, * JBoifo, 


the assumption |V;,| = 1 is no longer necessary. While the theoretical uncertainty in 
fp vB is quite considerable, the ratio between these quantities for the B, and By mesons 
is theoretically much better understood. Therefore, Equation (8) will have a significantly 
smaller error than Equation (7). Unfortunately, only an experimental lower limit is avail- 
able for Am, at the moment. 


We now consider CP violation for decay final states which can be produced by both 

? and B°. These can be CP eigenstates such as J/1? Kg or non-CP eigenstates such 

as D**n- and its CP-conjugate state D*-a+. If CP violation is present neither in the 

oscillation nor the decay amplitudes, it is well known that the only signature of CP 
violation that can appear is through phases: 


sin(óo + o;) sin (Amat) 
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in the time-dependent decay rates of initial B? and B?, where œo is the phase of the 
B°—B? oscillation amplitude, and @, is the phase of the ratio of the instantaneous decay 
amplitudes of B? and B? into the particular final state. 


For the J/*) Ks final state, the b + c+ W~ tree process dominates, but there can also 
be a b — s penguin process (Figure 5). The phase of the tree process is given by Vis, 
and is 0 in the Wolfenstein approximation. The penguin process is described by a virtual 
top quark loop with CKM matrix elements, Vj, V,?, which has only a small phase —dy. 
Therefore, à; = 0, with a theoretical uncertainty less than a few percent. 


Figure 5. Quark diagrams for the B? — J/w Ks decay. 


The final state D**7- can be generated by the tree process b — c + W- , followed by 
W- — ud for the B? decay or the doubly Cabibbo-suppressed tree process b — 7 + Wt 
followed by Wt — cd for the B" decay, as seen from Figure 6. From the relevant CKM 
matrix elements, it follows that ó; = y. Note that only the tree processes contribute. 


u d 
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Figure 6. Quark diagrams for the B? and B? + D**z- decays. 


The phase of the oscillation amplitude is given by $9 = 28 as discussed before. There- 
fore the CP violation signatures in B? and B? decays are 


sin(2@) sin (Amat) for J/ Ks 


and 
sin(28 + y)sin (Amat) for D**z- and D*^z*. 


For the sin(28) measurement from J/ Ks, the average of the recent results from 
BABAR and BELLE gives (Schubert 2001): 


sin 23 = 0.79 + 0.17 


where the error is scaled following the recipe of the Particle Data Group. It is unlikely 
that any definitive measurement of sin(28 + y), using B > D(**)5(?, will be done by 
the current generation of experiments. 
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Analogous decays for the B? meson system are J/v ¢ and J/v n, which have @; = 0, 
and D+K~ which has ¢; = y. Since the B9-B? oscillation phase is —2ó^ as discussed 
before, the CP violation signatures would be: 


sin(—2ôy) sin (Amst) for J/v 4 (or J/v n) 


and 

sin(—26y + y) sin (Amst) for DİK” and D; Kt. 
No definitive measurements on these quantities will be available from the current gener- 
ation of experiments. 


In Figure 7, we summarise the region of p and 7 allowed by the current data on 
|Vas|/ (A|Vas]), Ama, and sin(28), as well as the one-sided constraint on Amg/Am, from 
the lower limit on Am,. All the data are interpreted within the framework of the Standard 
Model, so the constraints may be modified by new physics as discussed earlier. There is a 
common overlapping region indicating that the data are consistent with the CKM picture, 
and no sign of new physics is visible within the errors. Note that the sin(28) measurement 
gives two allowed regions in the -ñj plane defined in Figure 7, and only the one consistent 
with the other constraints is drawn. 


1 
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Figure 7. Allowed region in the p- plane from |V.s|/|Vzs|, Ama, and sin(28) measure- 
ments, and from the upper limit on Am,. 


3 Possible CKM picture in 2006 


Let us now speculate about the CKM landscape in 2006. Due to progress in theoretical 
understanding of the hadronic effect, strongly helped by the large statistics and high 
quality data from BABAR and BELLE, |Vi./V.s| might become known with a relative 
error of +10%. Once Am, is measured by CDF, |Vial/(A]Vis|) will become known with a 
relative error of +7%. From CP violation in B? and B? decays to J/ Ks, the combined 
results from BABAR, BELLE, CDF and DO might yield a result of sin(28) with an error 
as small as 0.03, which is still statistically limited. Estimating the progress in other CP 
violation channels is more difficult since they have various theoretical uncertainties, and 
future improvements are difficult to evaluate. Assuming that the currently preferred values 
of p and 7 remain unchanged, the situation expected in 2006 is illustrated in Figure 8. 
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Figure 8. Possible situation of the allowed region in the p-1ij plane in 2006 


4 Presence of New Physics 


Once we allow the presence of physics beyond the Standard Model, the determination of 
the CKM matrix elements becomes much more complicated. Most of the extensions of 
the Standard Model introduce new heavy particles. The contributions of these particles 
to the tree processes should be negligible, but for B°-B° and B9-B? oscillations, their 
contribution to the box processes could be sizeable. New heavy particles could also appear 
in the penguin processes, but the effects should be smaller than in the box processes, due 
to the different dependence on the masses of the particles appearing in the loops. 


In the presence of new physics, the phases for the oscillation amplitudes are modified: 
po = 28 + ONp 
and 
dio -257 + bp 
where $4, and ó&p are new phases. 


The new physics has very little effect on the decay amplitude if this is generated by 
tree processes. Therefore, we assume that the phases of the amplitudes for decay modes 
such as B? + J/v Ks, B® — D**z-, B? + J/¢ and B? — D} K- remain unchanged. 
For the CP violation signatures in B? and B" decays, we would then have 


sin(28 + ¢4p) sin Amat for J/y Ks (9) 


and 
sin(28 + 6, + y) sin Amat for D**z- and D*^z*. 


The value of 8 is modified, but it is possible to use both measurements to extract the 
angle, y, in a theoretically clean way, even if new physics is present. 


Similarly for the B? system: 
sin(—2óy + óXp) sin (Amst) for J/ à (or J/v ) (10) 


and 
sin(—26y + ó&p + y)sin(Am,t) for DİK” and D; K*. 
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the value of dy is modified, but y can be cleanly extracted by combining the two CP 
violation measurements. 


Once y is determined, p and 7 are given by: 
PREL p= tiel 
À |Veo| è 
and the unmodified values of 8 and ôy can be obtained from Equation (4). Hence all the 


parameters of the CKM matrix can be completely determined even if new physics exists. 
The contributions from new physics are then determined from Equations (9) and (10). 


The allowed region in -7j plane from the CP asymmetry measurement with B" and 
B® — J/w Ks decays is no longer valid. What is measured from the asymmetry is not Ø, 
but 8 + ¢$p/2. Similarly, the region given by the Amg/Am, measurements is no longer 
valid since there could be sizeable contributions to both the Am from new physics. The 
only valid region in Figure 8 is that from the |V,,5|/(A|V.5|) measurement, and the apparent 
consistency seen in Figure 8 could be accidental. Once y is measured in the theoretically 
clean way explained above, the situation will become clear. Figure 9 illustrates a possible 
situation where y is measured with an accuracy of +5°, and is not consistent with the 
mixing and CP asymmetry constraints. In this case, the measurements of y reveal the 
presence of new physics contributions. 
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Figure 9. Possible situation in the p-i plane in 2006 in the presence of new physics 
contributions 


In the discussion above, we have made an assumption that new physics does not 
contribute to the decay amplitude. This may not be strictly true for the B? > J/y Ks 
and B? > J/) ¢ decays, since these decays receive a small contribution from the b > s 
penguin process. Whether new physics can appear in the penguin process can be examined 
by studying CP violation in B? and B? decays into 6 Ks. In the Standard Model the decay 
amplitude is dominated by a loop with the virtual top quark, so the phase is given by Vis, 
and the CP violation signature is: 


sin(28 + dip — 26* + op) sin (Amat) , (11) 
where $E, is the additional phase introduced in the b — s penguin process by new physics. 


By comparing Equations (9) and (11), it is possible to determine whether new physics 
contributes significantly to penguin processes. 
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New physics contributions to the b — d penguin processes can be studied from CP 
violation in B? and B? decaying into K*°K*® or ọn? where only the b — d penguin 
process contributes in the Standard Model. Independent studies of b > s and b > d 
penguins can be made with the B, system. 


In conclusion, a theoretically clean and model-independent determination of the pa- 
rameters of the CKM matrix is possible, even if there are new physics contributions to CP 
violation observables. Once the CKM parameters are determined in a model independent 
way, the possible existence of new physics can be cleanly established. 


5 B experiments at the LHC 


5.1 General considerations 


The goal of B physics in the LHC era (Ahmadov et al, 2000) is to determine the CKM 
parameters in a model-independent way and to isolate the effect of new physics so that 
its characteristics can be identified. This calls for an high statistics experunent capable of 
studying CP violation with both B? and B® systems decaying into various hadronic final 
states. 


The production cross section for bb quark pairs at the LHC energy is estimated to 
be approximately 500ub, which is far larger than at existing machines. The fraction of 
events with b quarks, o43/Ginelastic, is about 6 x 107?, which is similar to the fraction of 
charm events in present fixed-target charm experiments. Thus, the LHC appears to be a 
very promising place to perform high precision CP violation measurements in B meson 
decays. At the LHC, B,, Bs, B? and b-baryons are abundantly produced, in addition to 
Bt, B? and B®. 


In order to exploit the full potential of the LHC, the following capabilities are needed. 


1. A trigger sensitive to both leptonic and hadronic final states. 


2. A particle identification system capable of identifying p, K, 7, p and e within the 
required momentum range. 


3. A vertex detector able to reconstruct primary and B vertices very precisely. 
4. A tracking system with good momentum resolution. 


5. An electromagnetic calorimeter capable of reconstructing n° decays. 


5.2 The ATLAS and CMS experiments 


ATLAS (Armstrong 1994) and CMS (Bayatian 1994) are two general purpose collider 
detectors designed to perform high pr physics at the LHC, including studies of the top 
quark, detection of the Higgs boson, and searches for supersymmetric particles. They 
detect bb quark pairs in the central region of the pp interactions. As already demonstrated 
by CDF, such general purpose collider experiments can cleanly reconstruct B meson final 
states containing lepton pairs, such as J/ Ks, J/9 and £+. Both ATLAS and CMS 
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have excellent muon and electron detection capabilities, and a good vertex resolution, 
allowing them to collect a high statistics sample of such decays. However, their first level 
trigger on a high pr lepton is not sensitive to purely hadronic final states. Hadronic decay 
modes can only be triggered by a semileptonic decay of the other b quark, which has a low 
efficiency due to the relatively small branching fraction for the semileptonic decay. The 
two experiments also have no p/K/- separation capability in the relevant momentum 
range, although the energy loss, dE/dx, can be used for particle identification at low 
momenta. In particular they cannot separate kaons from pions in the high momentum 
region relevant for two-body decay modes of B mesons such as 7*- ^ and D,K. 


For these reasons, ATLAS and CMS will not be able to study CP violation with all 
the final states necessary to perform model-independent analysis. 


5.3 The LHCb experiment 


The LHCb experiment is a single arm forward spectrometer, which covers the region of pp 
collisions which is not looked at by the two general purpose detectors. The LHCb detector 
(Amato 1998) is designed to have a trigger that is equally efficient for leptons and hadrons. 
It will be able to exploit the full b physics potential of the LHC at a much lower luminosity 
(2 x 109cm-?s^!) than the nominal luminosity (10°4cm~s~'), and will therefore be able 
to perform its full physics programme from the beginning of LHC operation. 
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Figure 10. The LHCb spectrometer placed in the LHC Intersection-8 experimental area. 
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Figure 11. Polar angles of the b- and b-hadrons 


The detector layout is shown in Figure 10. It resembles a typical fixed target spec- 
trometer. A vertex detector is placed in “Roman pots” around the interaction region. 
It is followed by a tracking system, RICH counters with aerogel and gas radiators, a 
large-gap dipole magnet, a calorimeter system, and a muon system. An existing LEP 
experimental area will be reused to install the detector. The interaction point is shifted 
by 11m from the centre of the experimental hall, in order to accommodate the detector 
elements without extra excavation. 


The choice of a single arm detector geometry is based on the fact that both the b- and 
b-hadrons are predominantly produced in the same forward (or backward) cone at high 
energies. This is demonstrated in Figure 11 where the polar angles of the b- and b-hadrons 
are shown in events generated with the PYTHIA simulation programme. The polar angle 
is defined with respect to the beam axis in the pp center-of-mass system. Detecting both 
b- and b-hadron at the same time is essential for the flavour tag. 


Further advantages of the forward geometry are: 


» The b-hadrons produced are faster than those in the central region. Their average 
momentum is about 80GeV/c, corresponding to a mean decay length of ~ 7mm. 
Therefore, a good decay time resolution can be obtained for reconstructed B-mesons. 


» The spectrometer can be built in an open geometry with an interaction region which 
is not surrounded by all the detector elements. This allows the vertex detector 
system to be built with sensors which can be extracted away from the beam during 
the injection. During data taking, the sensors are positioned closer to the beam in 
order to achieve a good vertex resolution. 


» In the forward region, momenta are mainly carried by the longitudinal components. 
Therefore, the threshold value for the p, trigger can be set low for electrons, muons 
and hadrons without being constrained by the detector requirements. This makes 
the p, trigger more efficient than in the central region. 


» The momentum range required for particle identification is well matched to the 
Ring Imaging Cherenkov Counters, and the required size for the counters remains 
affordable. 


» The open geometry allows easy installation, maintenance and possible upgrades. 
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5.4 The LHCb detector 


Beam pipe 


A large vacuum tank with a length of 1.7m and a diameter of 1m is placed around the 
interaction point to accommodate the vertex detector system with its retraction mechan- 
ics. It has a 2mm Al forward exit window over the full detector acceptance. This part 
is followed by two conical beam pipe sections; the first part is 1.4m long with a 25mrad 
opening angle, and the second part is 16m long with a 10mrad opening angle. Except for 
bellows, flanges and the last 6.3m of the 10mrad cone, the beam pipe is made from Al-Be 
alloy in order to reduce the radiation length. This is essential to minimize the occupancies 
of the tracking and RICH systems, and to improve the detection efficiency for photons 
and electrons. 


Magnet 


A dipole magnet with an Al conductor provides a field integral of 4 Tm. The polarity of 
the field can be changed to reduce systematic errors in the CP-violation measurements that 
could result from a left-right asymmetry of the detector. The two pole faces form a wedge 
shape following the spectrometer acceptance, in order to reduce the power consumption. 


Vertex locator (VELO) 


A total of 27 stations of silicon microstrip detectors are placed perpendicular to the 
beam of which 25 stations are used as a vertex detector system. The remaining two 
stations are used for detecting bunch crossings with more than one pp interaction as à 
part of the Level-0 trigger (pile-up veto counters). All the stations are split into two 
halves, covering left and right 180? sections. Each vertex detector station consists of two 
sensor planes with different strip layouts, one for r and the other for @ measurements. 
The pile-up veto counters have only r measurement sensors. 


The closest distance between the active silicon area and the beam is 8mm. In order 
to cope with the high radiation dose expected so close to the beam, it is planned to 
use n-on-n silicon sensors. The silicon detectors are placed in Roman pots surrounded by 
250pm thick aluminium foil, which acts as a shield against RF pickup from the circulating 
beam bunches. To avoid mechanical collapse, a secondary vacuum is maintained inside 
the Roman pots. During injection and acceleration, the Roman pot system will be moved 
away from the beam to avoid interference with the machine operation and accidental 
irradiation of the detectors. 


For each bunch crossing, i.e. every 25ns, the signals are read-out and stored in analogue 
pipeline buffers. 
Tracking system 


Because of the high particle density close to the beam pipe, the LHCb tracking detector 
is split into inner and outer systems. The boundary between the two is chosen so that 
the occupancy of the outer tracker does not exceed 1596. 


The outer tracking system uses drift chambers based on a straw cell structure. The 
straws are made by winding 5mm diameter carbon-loaded Kapton foil around a central 
anode wire. The drift-time is sampled over 50ns, i.e.every two bunch crossings. 


The inner tracking system is made from single sided p-on-n Si strip detectors with a 
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strip pitch of ~200jm. Since the sensitive regions of the Si sensor are several centimetres 
away from the beam the problem of radiation damage is less severe than for the VELO 
detectors. 


The RICH detectors 


The LHCb detector has two Ring Imaging Cerenkov (RICH) systems, with three 
different radiators, in order to cover the required momentum range, 1-100GeV/c. The 
first RICH uses aerogel and C4F io gas as radiators. The second RICH, uses CF, gas as a 
radiator. It is placed after the magnet, and is responsible for identifying high momentum 
particles. In both RICHs the Cherenkov light is reflected by mirrors and detected with 
planes of Hybrid Photon Detectors (HPD's) placed outside the spectrometer acceptance. 


Calorimeters 


The calorimeter system consists of a preshower detector followed by electromagnetic 
and hadronic calorimeters. It also serves as the initial part of the muon filter system. The 
cells of the preshower detector are made from two scintillator plates sandwiching 14mm- 
thick lead plates. The cell size of the preshower detector is matched to the module size 
of the electromagnetic calorimeter. For the electromagnetic part a Shashlik calorimeter 
is used since a modest energy resolution is required. The hadron calorimeter is based on 
a scintillating tile design similar to that used in the ATLAS experiment. 


Muon detectors 


For the muon stations Resistive Plate Chambers are used in the region where the 
charged particle rate is below 1kHz/cm?. For the region with a charged particle rate from 
1kHz/cm? to 100kHz/cm?, Multi Wire Proportional Chambers (MWPC’s) are used. For 
the small region of the first muon station closed to the beam pipe, where the charged 
particle rate exceeds 100kHz/em?, triple-GEM chambers or MWPC’s with asymmetric 
gas gaps are being considered. 


Trigger 


The LHCb trigger is divided into four decision levels. The Level-0 decision is based on 
high-pr hadrons, electrons or photons found in the calorimeter system, or muons found 
in the muon system. Information on these candidates are sent to the Level-0 Decision 
Unit. The number of primary vertex candidates is determined using the pile-up veto 
counters in the Vertex Locator and also sent to the Level-0 Decision Unit. Based on all 
the information, the Level-0 Decision Unit makes an overall Level-0 decision. Events with 
multiple pp interactions are discarded, since rejection of background is more difficult in 
those events. The Level-0 trigger provides a modest reduction of minimum bias events by 
a factor of ~10. 


In the Level-1 trigger, data from the vertex detector is used to select events with 
multiple vertices. In addition, the high-pr candidates used in Level-0 can be combined 
with the tracks having large impact parameter in the Vertex Locator. This provides a 
further reduction of ~25 for minimum bias events. After a positive decision of the Level-1 
trigger, all the data is read out into an event buffer. 


At Level-2, a further enhancement of events with b-hadrons can be achieved by com- 
bining different detector components, e.g. by adding momentum information from the 
main tracking system to the impact parameter calculation with the Vertex Locator. At 
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Level-3, a further decision is made after fully reconstructing the b decays. 


The LHCb trigger system is designed to cope with the rather small o,5/@inelastic of 
~6 x 10^? at the LHC energies, while still maintaining a high efficiency for events with 
b hadrons. The strategy is to spread the suppression factors evenly and not to rely on 
any single trigger selection, in particular at early levels where available information from 
the detector is limited. This is reflected in the modest suppression factors of 10 and 25 
for the ordinary pp interaction events at Level-0 and Level-1, respectively. Simulations 
of the trigger performance can be relied upon for such a modest suppression. By not 
heavily relaying on a particular selection criterion, the trigger system is flexible and can 
be readjusted to the actual running conditions of the experiment. 


5.5 Performance of the LHCb detector 


The benefit of having particle identification and good invariant mass resolution can be 
demonstrated by reconstructing B, > D} K^ decays. The main background to this decay 
mode comes from the B, —^ D{n~ decays which are used to study B.-B, oscillations. 
Compared to this decay mode, the branching fraction of B, + D} K7 is suppressed by a 
factor of 1/A? zz 20. The two decay modes have an identical decay topology, and can only 
be distinguished by invariant mass and particle identification. It should be noted that the 
momenta of the K^ and v^ from these decay modes are large since they are two-body 
decays. 
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Figure 12. The LHCb simulation results for the reconstructed invariant mass distribu- 
tions for D! K^ combinations without and with particle identification using RICH. 


Figure 12 shows the reconstructed invariant mass distributions expected with the 
LHCb detector for D* K^ combinations without and with particle identification using 
RICH. The combination of the good mass resolution, c = 11MeV/c’, and RICH par- 
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ticle identification, mean that the background from B, — Dia decays can be almost 
completely removed in the reconstructed Dt K^ sample. It should be noted that no CP 
violation effect is expected in the background decay mode. The decay time resolution for 
these decays is found to be 40fs. 


With the particle identification capability, an efficient flavour tag can be obtained using 
the charged kaons from the b hadron decays. No lepton is required in the analysis for 
flavour tagging. Therefore, the Level-0 high pr hadron trigger increases significantly the 
statistics of the flavour-tagged sample of pure hadronic B decay final states, compared to 
using the high pr lepton trigger alone. Table 1 summarises the Level-0 trigger efficiencies 
for various decay modes. Efficiencies are calculated for those events where the initial 


Decay Mode Level-0 high-pr Level-0 
muon electron hadron | all combined 

B® 2 J/i(e*e-)Ks | 0.17 0.63 0.17 0.72 

B? > Jléb(u*u-)Ks | 0.87 0.06 0.16 0.88 

B, > D} K- 0.15 0.09 0.45 0.54 

B9 — ntr” 0.14 0.08 0.70 0.76 


Table 1. Level-0 trigger efficiencies for reconstructed and flavour tagged final states. 


flavour is identified and the final state is fully reconstructed with all the cuts applied to 
remove background. While J/ŅKs final states are mainly triggered by the muon and 
electron high-pr triggers, the hadron high-pr trigger is essential for the other final states. 


The table also indicates that the Level-0 trigger efficiencies are very high for the 
events that are useful in the analysis. As a result, the LHCb experiment will run with 
a luminosity of 2 x 10?cm ?s^! and still collect 2.4k reconstructed and flavour-tagged 
B, and B, decays into D? K^ and D; K+ in one year. With these statistics, y can be 
measured with an accuracy of ~10°. By combining the result from the decays of B? and 
B into D**z- and D*-7*, y will be measured with a precision of better than 7? with 
one year of data taking. 


At a low luminosity, the bunch interactions are dominated by events with only one pp 
collision. The running luminosity will be locally tuned at the LHCb intersection such that 
the experiment can run with this optimal luminosity while the other LHC experiments 
run at the design luminosity. It must be noted that running at lower luminosities has the 
additional benefit that the radiation damage to the detector is reduced. 


6 Conclusions 


CP violation outside the neutral kaon system has been observed for the first time by 
BABAR and BELLE experiments, in the decays of B^ and B® into the J/y Ks final state. 
Current analysis shows that all the measurements related to 6 hadron decays are consistent 
with the CKM picture, including CP violation. Although BABAR, BELLE, CDF and DO 
will further enhance these sttidies in the near future, a new generation of experiments at 
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the LHC is needed in order to examine the CKM picture in a model-independent way. It 
should be noted that a dedicated experiment at the Tevatron (Kulyavtsev 2000) would 
have a similar performance. LHCb is a dedicated detector for B physics at the LHC with 
particle identification capability and a trigger sensitive to both leptonic and hadronic final 
states, which should be able to study CP violation in both B? and B? meson systems 
in many decay modes. Some of those decay modes are essential to determine the CKM 
parameter in a theoretically clean and model-independent way. This will allow the effect 
of possible new physics to be identified unambiguously. 
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